DECEMBER 


VOLUME 20 
NUMBER 6 


JOURNAL 
PSYCHOLOGY 


AMERICAN PSYC CAL ASSOCIATION 


| 
> 
“ee 

‘ 


JOURNAL OF CONSULTING PSYCHOLOGY 


Published by the American Psychologica! \ssociacion, Inc. 
at Prince and Lemon Srs., Lancaster, Pa. 


and 1333 Sixteenth St. N.W., Washingron, D.C, 


Edited by Laurance F. Shaffer 
Teachers College, Columbia University 


Advisory Editors: Donald W. Fiske, University of Chicago. Ann Garner, Uminersity of Tlinois. Harrison 
G. Gough. University of California. William A. Hunt, Nortnwester | niversity. E. Lowell Kelly, University of 
Michigan. Morris Krugman, Board of Education, New York, N. |. Secrtha M. Luckey, Board of Education, 
Cleveland, Ohio. Fred McKinney, University of Missouri. Anne Roc, Vew York, N. Y. Carl R. Rogers, University 
of Chicago. Nevitt Sanford, University ef California, Edward Jose) Shoben, Jr., Teachers College, Columbia 
University. George J. Wischner, University of Pittsburgh. J. R. Witten vorn, Rudgers University. 

Lorraine Bouthilet, Managing Editor 


The Journal of Consulting Psychology is the clinica! journal of the American Psychological 
Association. It is devoted primarily to the publication of »'iginal research relevant to psycho- 
logical diagnosis, psychotherapy and counseling, person.\ity, and the dynamics of behavior. 
Case studies, theoretical contributions, descriptions of «|inical techniques, and discussions of 
the professional problems of clinical psychology also appear 'n its pages. 


Editorial Office 
Manuscripts submitted for publication and correspondence on editorial matters should be 
sent to the Editor, Laurance F. Shaffer, 525 West 120th St., New York 27, N. Y. All manu- 
scripts should be submitted in duplicate to facilitate referral to the Advisory Editors. Original 
figures are prepared for publication; duplicate figures may be photographic or pencil-drawn 
copies. Authors are cautioned to retain a copy of the manuscript to guard against loss in the 
mail, 


Style of manuscripts. Manuscripts should be pre- 
pared in the style described in the Publication Manual 


port published elsewhere. For detailed instructions on 
how to prepare « Brief Report, see the back of the 


of the American Psychological Association (Psycho- 
logical Bulletin, 1952, 49, 389-449). The Manual may 
be purchased from the Washington office of the Asso- 
ciation for $1.00. Otherwise acceptable manuscripts 
which depart from the standards of the Manual are 
returned for revision and retyping. 


Brief Reports. A one-page condensation of an accept- 
able research report may be given early publication 
without charge if the author agrees (a) to make 100 
mimcographed copies of the full report available to 
those who request it, (6) to send a copy of the full 
report to the Editor for deposit in the American Docu- 
mentation Institute, and (c) not to have the full re- 


Contents page of this Journal or write to the Editor. 


Early Publication, Authors may secure the early 
publication of an acceptable article in the first issue 
to go to press after its acceptance by arranging for 
the article to be printed as ertra pages, increasing 
the number of pages that subscribers receive. The 
charge to the author for such carly publication is 
$20.00 per printed page. 


Reprints. Authors receive fifty reprints gratis of all 
articles except early publication articles and Brief 
Reports. Additional quantities may be ordered when 
returning proof. 


Business Office 
Communications concerning subscriptions, change o{ address, claims for the nonreceipt of 
a number, advertising, and other business matters, shou! be sent to the American Psychological 
Association, Inc., 1333 Sixteenth St. N.W., Washington 6, D.C. Address changes must reach 
the subscription office by the 10th of the month to take «''cct the following month. Undelivered 
copies resulting from address changes will not be repia::|. subseribers should notify the post 
office that they will guarantee second-class forwarding stage. Other claims for undelivered 
copics must be made within four montlas of publication. 


The Journal of Consulting Psychology is published »:monthly in February, April, June, 
August, October, and December, the yearly volume com \r sing approximately 480 pages. The 
subscription per year is $8.00, foreign $8.50, single num!) $1.50. 


Entered as second-class matter, March 21, 1955, at the Office at Larcoster, Pa., aider the act of March 3, 1879. Accept- 
ance for mailing at special rate of postage provided for in lon $38, Act 0! February 2$, 1925, authorized May 27, 1940. 


Copyright © 1956 by the American Psycholo: «© Association, Inc. 


| 
a 
; 
| 


JQU RNAL OF 
CO: NSULTIN G 


December, 1956 


Contents 


Second-Order Personality Factors in the Questionnaire Realm: Raymond B. Cattell - - 
Factors, Response Bias, and the MMPI: Eugene H. Barnes - - - - - - - - = = 


The Personality of Judges and their ee with wasnt in Judging Clinical Case 
Histories: A.W. Bendig - - - - - = 


Comparison of Two Methods of Electroshock in Their Effect on Cognitive Functions: 
Barbara S. Kendall, Warren B. Mills, and Thomas Thale- - - - - - - - - - 


Group Psychotherapy with Chronic Psychotic Soiling Patients: John E. Tucker - - - - 


The Short-Term Effects of Chlorpromazine on Psychiatric Patients: William D. Winter 
and Wilbur K. Frederickson - - - - = = © s+ ee 


Use of the Proverbs Test for Differentiating Schizophrenics from Normals: Donald R. 


The Ability of Schizophrenics to in an Situation: M. 
David Diamond - - - - 


Concept Learning in the Educationally Retarded Child of Normal eae Nissim 
M. Levy and Joseph M.Cuddy - - - - - - - ----+-+-+-++--2-- 


The TAT and Antisocial Behavior: Kenneth Purcell- - - - - - - - = = = = = 
Situational Influences on Rorschach Responses: Edith M. Henry and Julian B. Rotter- - 


Affective Control and Rorschach Human Movement Responses: Julian Melizoff and 
Dorothy Litwin - - = = +--+ ++ = 


The Use of the Kaleidoscope as an Adjunct to the Rorschach: Emanuel Starer - - - - 


The Use of Human Figure Drawings in the Diagnosis of —_ nT: Marvin 
Reznikoff and Donald Tomblen - - - - - - - = = ae 


Certain Formal Characteristics of Human Figure Drawings by Institutionalized Aged 
and by Normal Children: Martin Lakin - - - - - - = = = = = = = = = 


Cue-Counting: A Measure of Anxiety in Interviews: Andrew S. Dibner - - - - - - 
Test Anxiety and Intelligence Test Performance: J. Zweibelson - - - - - - - - = 


The Relation of the Children’s Manifest Anxiety Scale to the Concept of ey as 
Used in the Clinic: Robert D. Wirt and William E. Broen, Jr. - - - - - - - 


The “Alienation Syndrome” and Adler’s Concept of “Distance”: H. L. Ansbacher - - - 


Some Comments on Ansbacher’s Note on the “Alienation Syndrome” and Adler’s Con- 
cept of “Distance”: Anthony Davids - - - - - - - - --+-+-+-+-+--- 


Psychological Test Reviews- - - - - - 
Volume Title Page and Contents Appear at the End of this Issue 


Vol. 20, No. 6 


411 
419 


422 


423 


431 


ip 
4 
2 
435 
441 
| | 
457 
463 
471 ig 
475 
479 
482 
485 
| | 
4 


Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
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D. C., remitting in advance $—— for microfilm or 
$—— for photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Congress. 
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duplication by the ADI. 
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Second-Order Personality Factors in the 
Questionnaire Realm 


Raymond B. Cattell 
Laboratory of Personality Assessment and Group Behavior, University of Illinois 


Although there has been much emphasis on 
the potential importance of broader, second- 
order personality factors, by Thurstone and 
others, and even some claims to recognizing 
particular second-order factors by Eysenck 
(20) and the present writer (2, 4), there 
has not existed until this year any very firm 
basis of evidence for conclusions. The present 
writer’s earlier attempt (4) was admittedly 
premature, while Eysenck’s extensive argu- 
ments for a second-order neuroticism factor 
and “extraversion” factor (20) seem partly 
based on the misapprehension, mentioned be- 
low, that second-order factors can be reliably 
obtained from factoring correlations among 
the actual scores on the existing measures for 
primary factors. There are, indeed, three 
rather widespread misapprehensions, dealt 
with below, which may confuse research on 
second-order factors. 

A second-order factor is a factor among 
oblique primary factors. That is to say, it is 
obtained by factoring a matrix of correla- 
tions among pure measures of primary fac- 
tors and rotating to simple structure. The 
second-order factors will be less numerous 
than first orders, or “primaries,” and will 
tend to cover wider areas of behavior. For ex- 
ample, the general ability factor, on which 
the IQ is based, covers more than the indi- 
vidual primary abilities of Thurstone, from 
which it is derived as a second-order factor 
(30). 

The theoretical and practical value of sec- 
ond-order factors, relative to first-order fac- 
tors, is an issue with partisans on both sides. 
In the theoretical realm some, like Hsii (25) 
see second-order factors as “super factors,” 
more permanent and more deeply rooted in 
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physiological or sociological fact than first- 
orders. Others (6, 24, 33) consider the pri- 
maries as the psychologically-real concepts 
and second-orders only as more transient or- 
ganizing principles among them. The former 
is illustrated by the theory that g is a func- 
tion of the total effective number of cortical 
neurones, but that the primary abilities 
merely represent particular training invest- 
ments of this general relation-perceivipg ca- 
pacity, developed in particular sensory or 
symbolic areas. The latter, ie., the alterna- 
tive interpretation, is illustrated by the pres- 
ent writer’s earlier explanation that the ob- 
served organization of certain personality 
factors (2) represents only the population 
correlation produced among them by selec- 
tion according to the social status conditions 
of the particular country concerned. (See also 
Fisher, 21.) 

Regardless of theory, the practical psychom- 
etrist can predict test performance either 
from first- or second-order factors. But here, 
too, there are preferences, as shown, for ex- 
ample, by the British predilection for work- 
ing with Spearman’s g—the second-order fac- 
tor in abilities (33)—and the American 
preference for primary abilities. If we rule 
out specifics as too numerous to be useful, 
prediction from primaries accounts for the 
greatest possible actual test variance, but 
from second-order factors there is a wider 
field of prediction, in terms of one factor (6, 
30). 


Second-Order Factoring 


For the psychologist, as contrasted with the 
statistician, the importance of factors lies be- 
yond immediate prediction. It concerns isola- 
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tion of psychological concepts and the devel- 
opment of psychological laws and insights, 
which extend the understanding of behavior 
beyond the immediate, static, measurement 
situation. Here, both the first- and the sec- 
ond-order structure are parts of the total 
necessary knowledge, as will be exemplified 
in the present study. However, as indicated 
in the introductory remarks, the second-order 
factors must for this purpose meet certain 
conditions not sufficiently recognized in some 
current studies. 

In the first place, it does not suffice to take 
any existing test or battery measuring the 
factors, to intercorrelate the factor scores, 
and analyze. Today, and for many years to 
come, the individual factor measures will be 
substantially impure. Each will contain, in 
addition to the inevitable specific, some con- 
tamination with other factors present in the 
battery. The primary measures may, also, in- 
clude in common some factors, e.g., response 
sets, which have nothing to do with the 
intended primary measures per se. Conse- 
quently the factoring of these actual primary 
factor test scores, presuming that each is split 
in two, or increased in number in some way 
to give a sufficiency of variables for deter- 
mining the number of common factors in- 
volved, will yield the primaries again, to- 
gether, perhaps, with the spurious common 
factors among them due to dimensions ex- 
traneous to the primary dimensions them- 
selves, e.g., to the response format of the 
battery. 

This theoretical expectation has actually 
been verified in Thurstone’s (31) and Lovell’s 
(27) work on Guilford’s factors (24), where, 
if the battery for each factor is split, and 
each factor reliability is put in the matrix, 
the analysis is shown to yield the primaries 
again with tolerable precision. This result will 
be readily appreciated by anyone who has 
seen the complete discrepancies between the 
correlation between the raw scores of any 
two factor measures and the correlation found 
from measuring the cosine of the angle be- 
tween the factors on their simple-structure 
plot. The former, dependent on a cluster of 
items, is pulled all over the place by the ad- 
dition of fresh items, with their particular 
contaminations; but the latter remains firm. 
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For the hyperplane is fixed by variables in 
many dimensions that are not loaded in the 
factor in question. However, it must be 
granted that where factors are already very 
purely measured by existing tests, with no 
systematic bias and with effective suppres- 
sion of peer factors in each factor score, the 
factoring of actual factor scores, i.e., of one 
variable for each factor, may yield a reason- 
able approximation to second-order factors. 
This seems to have happened with the 16 
P.F. test in the large Air Force study (8). 

However, the matrix with which a second- 
order factoring properly begins is the cosine 
matrix among the simple-structure primary 
factors. Few oblique rotations to simple struc- 
ture undertaken in the usual course of deter- 
mining the form of the primaries are accurate 
enough for a second-order factoring. Indeed, 
as French’s (22) and other surveys show, 
perhaps a half of all published factor analy- 
ses falter and fall short of their goal—even 
the goal of defining the first-order factors— 
for lack of thorough search for simple struc- 
ture. Even the last couple of over-all, “polish- 
ing” rotations, in the usual series of ten to 
twenty over-all rotations from the original 
position, will substantially alter the number 
and nature of second-order factors that would 
be obtained. 

One needs also to be warned against ab- 
sent-mindedly factoring—as the writer has 
done before (4)—the matrix of cosines thus 
achieved, for these are the correlations among 
the reference vectors. Such a matrix must be 
inverted to give correlations among the fac- 
tors. Thus when, as in the usual notation, A 
is the obtained transformation matrix, pro- 
ceeding from the extracted factor matrix, V,, 
to the best simple structure matrix, V,, the 
required square matrix for second-order fac- 
toring is 

*D 


where D is a diagonal matrix the purpose of 
which is to normalize by rows and columns 
the obtained inverse (6). In summary, the 
three pitfalls to be avoided in second-order 
factoring are: taking primary correlations 
from raw scores instead of the simple struc- 
ture plot; using reference vector correlations 
instead of factor correlations; and, as dealt 
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with below, leaving second-order factors or- 
thogonal instead of seeking simple structure 
for them also. 


The Questionnaire Foundation 


The number of primary personality factors 
in questionnaire responses is still not fully 
determined, but the recent studies of Guil- 
ford (24) and of the present writer (10, 11, 
12) converge in indicating that (among 
adults) about fifteen will account for most 
of the variance in highly varied collections 
of items. In the case of the author’s studies 
and those of his research associates, the close 
coordination maintained between question- 
naire and observer behavior-rating factors 
brings additional confirmation, in that some 
fifteen factors have also been found in the 
latter (3, 12). 

The fifteen primary personality factors on 
which the present study is based have since 
1949 been embodied in the Sixteen Person- 
ality Factor Questionnaire (15), into which, 
for general utility, general intelligence has 
been built as the sixteenth factor. The factor 
structure of the 432 items (466 less 34 in- 
telligence items) comprising the A, B, and C 
forms of this test has been verified in three 
studies, each experimentally, statistically, and 
rotationally independent (5, 10, 11). A wide 
breadth of behavior reference in the 16 P.F. 
items has been guaranteed by the historical 
survey (3) and by the concept of sampling 
the personality sphere made in the first analy- 
sis (5), as well as by examination of 2,002 
further items in the two succeeding studies 
(10, 11). 

Hypotheses as to what the fifteen factors 
may be which so persistently appear have 
been put forward (3, 15, 22) and debated. 
However, it is only recently, with the return 
of external validation results from the 16 P.F. 
Test (9, 14, 15, 34), in clinical, educational, 
and industrial use, that these concepts have 
been evaluated and sharpened. For example, 
the examination of the degree of inheritance 
of these factors made on their equivalent 
forms in the Junior Personality Quiz (13) 
with eleven-year-old twins, siblings reared 
apart, etc. (14), has thrown much light on 
their basic natures. From these studies, it ap- 
pears that 2 of the 15 factors (A and #) 


are hereditary components of the cyclothyme- 
schizothyme temperament continuum; that 
others, e.g., surgency-desurgency (F), and 
sensitivity (J), are almost wholly environ- 
mentally determined; that the factors C and 
G correspond to the clinical concepts of ego 
strength and superego strength; that EZ, L, 
and Q,; correspond to the concepts of domi- 
nance, paranoid trend, and radicalism, and 
that O and Q, characterize two relatively dis- 
tinct components in anxiety. More precise 
discussion can be given to these meanings of 
primaries where it becomes relevant to the 
interpretation of the second-order factors 
among them found here. 


Correlations Among Primaries 


Determination of the cosines (correlations) 
among the fifteen primary factors was based 
on the third (10) of the above studies on 
the 16 P.F., which had benefited from the 
preceding studies in acquiring more highly 
loaded items, and, therefore, a clearer factor 
structure. The correlation matrix was of or- 
der 75 X 75 and consisted of 2,775 product- 
moment correlations on an N of 408 subjects 
(181 undergraduate men and women stu- 
dents and 227 Air Force young men in train- 
ing). Although the 16 P.F. test was repre- 
sented by only 75 variables it will be seen by 
reference to the studies indicated (10, 11) 
that these truly represent the factor structure 
of the whole test. The special procedure of 
parcelled factor analysis, by which the 163 
highest loaded items in the test are parcelled 
into 75 short scales of two and three items 
each (because of the impossibility of handling 
a 163 X 163 matrix) has been described else- 
where (11). It gives the test as a whole the 
structure of the 75 parcels, and evaluates each 
item in the total 16 P.F. against the factors 
established in this framework. 

For purposes of determining the angles 
among oblique factors with great exactitude, 
in order to enter a second-order factor analy- 
sis, this design of having each factor loaded 
on only 5 out of 75 variables is particularly 
valuable. For the even distribution (each fac- 
tor represented by the same number of vari- 
ables) and the great preponderance of vari- 
ables in every hyperplane (70 out of 75 is 
possible, and was closely approached in our 
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Table 1 


Cosines Among Reference Vectors 
Primary Personality Vectors 


A © ® © ® O O OQ 


(G) 00 04 —02 05 1.00 

(H) —36 —12 —18 —.26 00 1.00 

(1) —29 —04 -06 O01 00 14 1.01 

(L) —01 —06 -—11 —04 —.04 1.00 

(M) 0 14 06 26 —04 05 —14 -07 99 

(N) 08 O01 —11 —12 43 —05 .26 —05 —.02 1.01 

(O) 02 19 00 02 13 —.11 —08 04 1.00 

-15 05 —31 -02 11 -09 02 -19 OO 1.01 

(Qs) 07 -01 18 22 02 03 —11 —01 —11 03 08 —.14 1.01 
(Qs) 30 -19 12 22 —.29 -—02 11 —04 —22 .20 00 .12 1.00 
(Qs 0 0 02 O08 —-05 15 —15 —07 —17 03 O1 .23 .99 


study) permits a very sure perception of hy- The simple structure was actually approxi- 
perplanes and their angles. Actually the mean mated with relatively little labor by applying 
percentage of variables obtained in the aver- Wrigley’s quartimax method (28). This gives 
age hyperplane (between + .10 loadings), for the nearest approach by an analytical method, 
all factors, came to 91 per cent, which is and within a Procrustean orthogonal struc- 
larger than in any other published or unpub- ture, to the eventual oblique structure. How- 
lished study known to the writer. Secondly, ever, since this analytic method stops short 
the determination of each hyperplane by as_ of the best simple structure, by tending to 
many as fourteen factors, i.e., in fourteen- leave too much variance in the centroid and | 
space, precludes any serious distortion of an- through failing to follow the natural obliqui- 
gles from the composition of the particular ties, rotation was finished “by hand.” Graphi- | 
cluster of variables which might happen in cal plots of each factor with every other (105 

this study to define, i.e., to be highly loaded plots per rotation) were made. After four ro- 
in, any given factor. tations of all factors from the quartimax po- 


Table 2 


Correlations Among Factors 
Primary Personality Factors 


© © ® ® ® W O QO OQ QW 


(Qi) —.16 —.03 31 —03 —.11 08 —.05 —.02 18 —.03 
d : P 04 11 
(Q:) -—32 47 —03 —.14 18 42 —.24 —39 —.08 04 —.11 
Q) 03 —31 —01 02 —02 —28 -03 33 09 -05 38 —04 03 —.42 


| 
(A) 1.01 
(C) 13 1.00 
(E) 07 06 1.00 
(F) -01 0S 00 99 
(A) 
(Cc) —.19 
(E) —02 —.04 
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sition, a very clear structure was obtained. Table 3 
A fift h “polishing rotation wae carried out Rotated Second-Order Factor Matrix 
to give the best possible determination of 
angles. The mean hyperplane content indi- Prima 
cated above, 91 per cent, was thus obtained, Perf F F: F; F, 
and the significances of the simple structure 
determinations, tested by Bargman’s formula A +07 +45 +.56 —.10 
Cc —.53 +.10 — 25 —.10 
(1), was found to be beyond p = .001 for E +.02 _ 01 +04 +50 
each of the fifteen factors. F +.17 +.48 +.04 +.21 
The cosine matrix, C = A’A, among the ref- G — .09 — 01 — .08 —.23 
erence vectors at this point is given in 
Table 1 and its inversion and normaliza- L 
tion to the correlations among the factors, M +.07 — 51 +.13 +.04 
K = DCD, in Table 2. It will be observed N —.08 —.09 —.48 +.34 
that as usual the intercorrelations among the O +.62 — 07 +.16 — 05 
factors are higher than among the vectors. 
Q; —.66 —.01 —.29 —.02 
Qa +.62 —.09 —.09 +.00 
% | 60 
L Note.—The unrotated matrix (the quartimaxed matrix, Voo, 
50 from which the transformation matrix operates) and the trans- 
formation matrix by which this table was obtained have been 
40 deposited with the American Documentation Institute. Order 
Document No. 4969 from ADI Auxiliary Publications Project, 
30 Photoduplication Service, Library of Congress, Washington 25, 
. C., remitting in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to Chief, Photoduplication 
20 Service, Library of Congress. 
EJ second-order factors. Simple structure may 
8° here be harder to determine without am- 
*Q, ‘}o biguity, because of the fewer variables, but 
20 enough second-order studies now exist (12, 
30 25, 30, 33) to show that simple structure 
as commonly resides in the data and gives psy- 
chological meaning to factors thus uniquely 
*c determined. Accordingly, the second-order 
60 


Q,° 


Fig. 1. Simple structure in the second-order realm: 
F, plotted against Fo. 


The Second Order Factors 


On the factor extraction from Table 2, 
carried out by the centroid method, Tucker’s 
criterion (6) indicated four or five factors. 
Five were extracted, but subsequent rotation 
clearly collapsed one of them. One of the 
sources of confusion in second-order factor 
solutions, mentioned above, is that experi- 
menters who rotate primaries for oblique sim- 
ple structure nevertheless sometimes adopt 
orthogonal, unrotated, centroid interpretations 
for second-order factors. The well-known rea- 
sons for oblique simple structure in primaries 
(6, 30) actually apply just as forcibly to 


factors were rotated, and, after seven rota- 
tions, the structure shown in Table 3 was 
found. From the typical graph in Fig. 1 it 
will be evident that this is a very clear struc- 
ture and practically unimprovable. (Possibly 
a 5° shift could still be made on each factor.) 
The number of variables in the hyperplane— 


Table 4 


Correlations of Second-Order Factors 
(Simple structure after seven rotations) 


Factor 1 2 3 4 
1 1.02 
2 — .02 1.21 
3 15 —.29 1.09 
4 02 43 —.13 1.14 


| 
| 
| 
| 
| 
| 


416 Raymond B. Cattell 


slightly more than half—is about that found 
in most satisfactory first-order structures. 

The intercorrelations of second-order fac- 
tors (D(A’A)“D), derived from the trans- 
formation (Lambda) matrix are shown in 
Table 4. 

The chief indicators of the nature of Fac- 
tors 1 through 4 in Table 4 will now be set 
out verbally, a contingent title being given to 
each factor. 


Factor 1. Anxiety-vs.-Dynamic 


Symbol Integration Loading 
Q:(—) Lack of will control! .66 
O(+) Insecurity or free-floating anxiety 62 
Qu(+) Nervous tensions (somaticanxiety)  .62 
L(+) Paranoid trend 54 
C(—) Lack of ego strength .53 


This factor has clearly brought together 
the “pathological” primaries. Before turning 
to interpretation one may point out that the 
evidence on second-order factors among be- 
havior-rated personality primaries, which is 
published synchronously (12) with this ar- 
ticle, shows a factor similar in all respects 
except that Factor I, called “Premsia” (12), 
is included there. Three theories compete for 
acceptance here: first, that it is a factor of 
general anxiety; second, a factor of general 
neuroticism; and third, that it is the differ- 
ent form of personality disorganization asso- 
ciated with psychoticism (20): see also Fac- 
tor U.I. 25 in (7). The last has too many 
speculative elements and the first is somewhat 
weakened by the absence of the third main 
form of anxiety—desurgency (F —) or anx- 
iety depression. On the other hand, it is strik- 
ing that this set of five primary factors is 
identical with that found to correlate sub- 
stantially with (and only with) the objective 
test factor (7) of Anxiety (indexed U.I. 24). 
(Both patterns also show some low H factor.) 
Nevertheless, considering the immediate na- 
ture of the above primary factors alone, one 
might incline to an interpretation of the sec- 
ond-order factor as Dynamic Integration-vs.- 
Neuroticism. There stands in the way of this 


1 The titles used are those of the 16 P.F. Hand- 
book (15). To make easy, consistent reading, all 
loadings have been made positive, by reflection, in 
these factor lists. The reflection is indicated by the 
sign on the left and the change of title to the op- 
posite pole, eg., “Lack of... .” 


interpretation Eysenck’s interpretation of U.I. 
23 as the neuroticism factor (20), an obstruc- 
tion qualified somewhat by our reinterpreta- 
tion of U.I. 23 as reduced neural reserves, 
rather than neurotic conflict per se. On the 
other hand, U.I. 24, with which the present 
second-order factor apparently identifies it- 
self, is better described as anxiety than as 
conflict. Accordingly, since reliable descrip- 
tion is more important than interpretation at 
this stage, it would seem best to compromise 
by naming the present second-order question- 
naire factor Dynamic Integration-vs.-Anxiety. 


Factor 2. Extraversion-vs.- 


Symbol Introversion Loading 
M(-—) Practical concern or praxernia (12) 

(vs. Bohemianism oralaxia[12]) 
F(+) Surgency 48 
A(+) Cyclothymia 45 
Conservatism 42 
Q2(—) Lack of self-sufficiency 38 
H(+) Adventurous cyclothymia or parmia 

(12) 33 


Since the parallel study in rated behavior 
factors is only now appearing (12), it may 
be stated for the reader’s information that a 
second-order factor (F4) resembling this is 
found therein, combining high F and M(—) 
with high D (not entering into questionnaire 
experiments). There, E factor, dominance, 
also enters in, and it is noteworthy that an 
alternative rotation considered here for Fac- 
tor 2 on 4, reducing their correlation, would 
also bring it in here. A second confirmation 
of the pattern is found in that U.I. 32 among 
objective tests correlates significantly (p = 
.01) with Factors F, A, Qe, H and E; though 
M and Q, are not as significant. These are 
thus, once more, quite independent indica- 
tions of the same pattern. 

This pattern has pulled together two pri- 
mary factors which have been long recog- 
nized as cyclothyme-schizothyme in charac- 
ter, but it emphatically adds others which 
might go under the broader concept of ex- 
traversion—such as surgency, dominance, 
practical concern and lack of self-sufficiency. 
Along with conservatism these point to a 
person closely identified with the group life— 
“a good fellow”—as opposed to the independ- 
ent recluse. The relatively low loadings of the 
more hereditarily determined H and A fac- 
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tors (14) suggest that Factor 2 is an envi- 
ronmentally determined pattern. There now 
seems ground for recognizing extraversion-in- 
traversion as something more than a correla- 
tion cluster or surface trait, and perhaps for 
specializing the term for this measurable 
second-order factor. 


Factor 3. Cyclothyme-vs.- 


Symbol Schizothyme Constitution Loading 
A(+) Cyclothymia 56 
I(+) Emotional sensitivity or premsia 

(12) 53 
N(-) Lack of sophistication A8 
Q3(—) Lack of will control .29 
C(—) Lack of ego strength 25 


Again, a somewhat similar factor is found 
in the second-order among behavior-rated fac- 
tors, in No. 5 (12), which links cyclothymia 
with low ego strength. A constitutional, body- 
build hypothesis was there suggested for ex- 
periment. Although only preliminary results 
are available it seems also to be the pattern 
(high A, J, low N) distinguishing manic- 
depressives from normals (15), and has fair 
resemblance to that distinguishing women 
from men (15) (but women have higher 
manic-depression and lower schizophrenia 
rates, as is well known). A constitutional 
body-build factor connected with cyclothymia 
is hypothesized. 


Factor 4. Unbroken Success-vs.- 


Symbol Frustration Loading 
Q:(+) Radicalism 55 
E(+) Dominance 50 
N(+) Sophistication 34 
G(—) Low super ego strength .23 


Again it is noteworthy that a second-order 
factor (Factor 2) among observer-rated be- 
havior factors (12) presents a pattern agree- 
ing excellently with this in the variables com- 
mon to the two studies (Z, N, and G). The 
present study adds Q; and the objective be- 
havior study adds high intelligence. Possible 
hypotheses are “easy personal success” occa- 
sioned by good intelligence or good educa- 
tional opportunity, or “social status,” which 
is known to be correlated with intelligence 
and dominance (2), and could well be cor- 
related with sophistication and radicalism, in 
young men. Incidentally, the reader is re- 
minded of the evidence that “radicalism,” as 
a factor, is more than an accident of politi- 


cal belief, and constitutes a general tempera- 
mental source trait (19, 29). 

Doubtless a generous native endowment in 
abilities plays a large causative role in this 
syndrome. One is reminded of a famous am- 
bassador’s comment on a British Secretary 
of State, known for his radical innovations, 
dominance of the world scene, sophistication, 
and debauchery (low G/): “his capacity is 
on so high a level that he has no need to 
give up his pleasures in the slightest degree 
for any cause” (17, p. 478). But in the life- 
record factor (12) the intelligence loading is 
scarcely high enough to make it the essential 
root, and the loadings are similar to those for 
social status. Consequently it is safer to con- 
sider this as the organization within person- 
ality brought about by unbroken success, 
from whatever cause, as opposed to frustra- 
tion and intellectual bafflement. Among stu- 
dents it should correlate negatively with age. 


Summary 


1. A questionnaire—the 16 P.F.—broadly 
based on the personality sphere and over 
2,000 items, had its factor structure redeter- 
mined on a sample of 408 young men and 
women, with particular attention to exact ro- 
tation to the best simple structure. 

2..The correlations thus determined, i.e., 
by inversion of the reference vector matrix, 
among the 15 confirmed primaries (intelli- 
gence was omitted) factored to four second- 
order, simple-structure factors. 

3. These four factors have resemblances, 
ranging from fair to excellent, to four second- 
order factors found from life-record (behav- 
ior rating) primaries, and two of them have 
closely similar profiles to the correlations 
found between questionnaire primaries and 
two objective personality test factors. The 
latter seems to confirm the surmise, made on 
general principles, that objective test factors 
would tend to be on a different level from 
rating and questionnaire factors, and, gener- 
ally, to be second-order to them. 

4. The factors have been contingently 
named Dynamic Integration-vs.-Anxiety (Ob- 
jective test factor: U.I. 24); Extraversion- 
vs.-Introversion, (Objective test factor: U.I. 
32); Cyclothyme-vs.-Schizothyme Constitu- 
tion; and Unbroken Success-vs.-Frustration. 
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There is a significant positive correlation here 
of the second and fourth, as also was found 
in behavior rating second-order factors (12). 
Definite hypotheses are proposed for their 
further experimental investigation. As a part 
of such investigation it is desirable to score 
individuals on the 16 P.F. not only for pri- 
maries but also for these four second-order 
factors (from the above loadings) to see if 
higher correlations with a specific criterion are 
associated with second-order than with first- 
order factors. Dynamic Integration scores 
should be of particular interest to the cli- 
nician. 

Received January 24, 1956. 
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Factors, Response Bias, and the MMPI 


Eugene H. Barnes 
VA Hospital, Downey, Illinois 


Certain deductions concerning the influence 
of response set on the MMPI were tested in 
a recent article by Barnes (1). These deduc- 
tions were derived from Berg’s deviation hy- 
pothesis (2) and other pertinent research. 
This hypothesis asserts: “Deviant response 
patterns tend to be general; hence those devi- 
ant behavior patterns which are significant 
for abnormality and thus regarded as symp- 
toms, are associated with other deviant re- 
sponse patterns which are in noncritical areas 
of behavior and which are not regarded as 
symptoms of personality aberration” (2, p. 
62). Since the scoring system of the individual 
form of the MMPI provides that only those 
items answered in the infrequent direction are 
recorded, the MMPI provided a convenient 
source of deviant responses as defined by 
Berg. In his article Barnes (1) presented cor- 
relations of the MMPI scales with the num- 
ber of items answered atypically true and 
with the number answered atypically false. 
He found the tendency to deviate by answer- 
ing false to be more highly correlated with 
the scales of the neurotic triad (Hs, D, Hy) 
and the tendency to deviate by answering 
true to be more highly correlated with the 
scales of the psychotic triad (Pa, Pt, Sc). 
Further, the two tendencies were not signifi- 
cantly correlated with one another. 

A factor analysis of the MMPI scales 
based on the scores of 110 neuropsychiatric 
patients was reported by Wheeler, Little, and 
Lehner (6) in which two major factors and 
one minor factor were identified. Factor I, 
the “psychotic” factor, had its maximal load- 
ing on Pt and Sc. This factor also had a sub- 
stantial negative loading on K. Factor II, the 
“neurotic” factor, had its maximal loadings 
on the scales of the neurotic triad. 


The correlations reported by Barnes (1) 
are sufficiently similar to the factor load- 
ings on the two major factors reported by 
Wheeler, Little, and Lehner (6) to suggest 
that the sets may be empirical correlates of 
the rationally derived factors. Specifically, it 
is hypothesized that, if included in the cor- 
relation matrix, the vector representing num- 
ber of true answers would be approximately 
collinear with Factor I, while the number of 
atypical false answers would be approximately 
collinear with Factor II. 


Procedure 


A direct test of this hypothesis does not 
appear possible since different samples are in- 
volved. An indirect method is to assume the 
existence of a test whose vector is collinear 
with the reference vector, calculate the cor- 
relations of each test in the matrix with the 
hypothetical test, and compare these calcu- 
lated estimates with the obtained correlations 
of the test under investigation. In order to 
make these calculations it is necessary only 
to know the communalities of the tests in the 
original matrix, the cosine of the angle each 
makes with the reference vector, and the com- 
munality of the hypothetical test (5). Re- 
fined estimates of the communalities of the 
tests in the matrix are provided by the factor 
analysis. The cosine of the angle each test 
makes with the reference vector is simply its 
factor loading divided by the square root of 
the communality. The communality of the 
hypothetical test must be estimated. In this 
predicament it seems logical to use the same 
approximation recommended for such esti- 
mates in the original factor problem, i.e., the 
highest correlation obtained between the sus- 
pected collinear test and any of the other 
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Table 1 


Correlations of Atypical Responses and Hypothetical 
Tests with Scales of MMPI 


Hypo- Hypo- 

Atypically thetical Atypically thetical 

Scale false Test II true Test I 
L 37 — —.24 — 
F 43 .66 .64 
K .06 — .69 —.67 
Hs 63 52 50 
D 73 55 34 46 
Hy 59 72 —.09 
Pd .28 .68 .67 
Mf .28 36 
Pa 37 — 52 .60 
Pt 31 84 .89 
Se 36 .90 88 
Ma 21 59 


tests in the matrix. In order to obtain esti- 
mated correlations using this approximation it 
is merely necessary to compute the scalar 
product of the two vectors, which turns out 
algebraically to be the square root of the 
estimated communality multiplied by the fac- 
tor loading of the test with which its corre- 
lation is to be inferred. The communality of 
the test suspected to be collinear with Factor 
I (atypical true answers) is estimated at .90, 
its correlation with Sc. The communality of 
atypical false answers is estimated at .73, its 
correlation with D. 


Results 


The correlations expected for the hypo- 
thetical collinear tests as compared with the 
correlations obtained by Barnes (1) are pre- 
sented in Table 1. It will be noted that esti- 
mates are not made for all tests on both fac- 
tors. This is because factor loadings were not 
reported on these tests by Wheeler, Little, 
and Lehner (6). It is presumed that these 
omitted factor loadings were not considered 
significant. This would be consistent with the 
low predicted correlations. 

It is obvious that the predicted correlations 
for a hypothetical test collinear with Factor I 
are in very close agreement with the experi- 
mental results of correlating the number of 
atypically true answers with the scales of 
the MMPI. The agreement of the predicted 
and obtained correlations for the number of 


Table 2 
Communalities and Reliabilities of MMPI Scales 


Communalities Reliabilities 

Wheeler, Holzberg 

Little, & Rosen & Alessi 
Scale Lehner (6) (4) (3) 
L .234 62 85 
FP .700 81 .93 
K 643 65 
Hs 873 85 67 
D 741 80 80 
Hy 771 88 87 
Pd 551 88 52 
Mf 338 .64 .76 
Pa .780 75 .78 
Pt 944 80 72 
Sc 878 83 89 
Ma A494 .56 


atypically false answers is not so close, nor 
yet does it seem so divergent as to refute the 
hypothesis. Unfortunately there does not seem 
to be any significance test which can be ap- 
plied to the kind of data presented. The mean- 
ing of the results cannot be evaluated, there- 
fore, with the precision which is desirable. 


Discussion 

The correlations between the number of 
items atypically answered true and the scales 
of the MMPI are cogently similar to the cor- 
relations predicted for the hypothetical test, 
despite the lack of the significance test. There 
would appear to be little risk involved in ac- 
cepting this measure as a “pure factor test” 
of the “psychotic factor.” Although the pre- 
dicted correlations for the number of atypi- 
cal false answers do not have such a striking 
similarity, it is clear that, if included in the 
test matrix, this measure would have a heavy 
loading on the “neurotic factor.” For this rea- 
son scores based on these response sets should, 
in themselves, be of considerable interest. It 
should be remarked, however, that a measure 
of the “psychotic factor” cannot be accepted 
as a measure of psychotic tendencies without 
further experimental evidence. A similar and 
more emphatic caution applies to the “neu- 
rotic factor,” and the tendency atypically to 
answer false. 

In addition to their interest as measures, it 
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would seem that they might have another 
value. A comparison of the communalities re- 
ported by Wheeler, Little, and Lehner (6) 
with reliability coefficients reported by Rosen 
(4) and by Holzberg and Alessi (3) suggests 
that a degree of specificity remains in some 
of the scales. This comparison appears in 
Table 2. Since these two factors accounted 
for most of the communality of the scales of 
the MMPI and, since the two response sets 
appear to be relatively pure measures of the 
factors, it would appear that some adjust- 
ment of the scoring should be possible to 
reveal the remaining specificity. Such an 
adjustment should considerably increase the 
diagnostic precision of the MMPI. It is not 
clear to the writer exactly how this adjust- 
ment could be accomplished, but it seems in- 
escapable that some allowance for response 
set will be necessary in the interpretation of 
the MMPI. 

It is possible that the method of analysis 
used in this study may recommend itself to 
other investigators. The technique may be of 
value to the test constructor seeking to de- 
velop “factor pure” tests as a check of sorts 
on his success. Further, it might be used as 
a method of selecting measures to be included 
in a matrix for factor analysis. It would seem 
particularly appropriate if a previous analy- 
sis has given suggestive but somewhat equivo- 
cal results. 


Summary 


Two major factors, a “neurotic” factor and 
a “psychotic” factor, were identified by 
Wheeler, Little, and Lehner (6) in a factor 


analysis of the scales of the MMPI. Hypo- 
thetical tests collinear with these factors were 
postulated and the correlations of these hypo- 
thetical tests were computed for each of the 
scales in the correlation matrix for which fac- 
tor loadings were reported. These correla- 
tions were compared with those obtained by 
correlating the number of atypical true an- 
swers to the MMPI with the scales of the 
MMPI and those obtained by correlating the 
number of atypical false answers with the 
scales of the MMPI. It was concluded that 
the number of atypical true answers is a 
“pure factor test” of the “psychotic factor” 
and that the number of atypical false answers 
has a heavy loading on the “neurotic factor.” 
It was also concluded that some allowance 
should be made in the interpretation of the 
MMPI to allow for the influence of response 
set. 


Received February 6, 1956. 
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The Personality of Judges and Their Agreement with 
Experts in Judging Clinical Case Histories 


A. W. Bendig* 


University of Pittsburgh 


Large individual differences among judges 
in assessing clinical case histories suggest that 
variables such as intellectual ability, educa- 
tional level, interests, and personality traits 
may influence their judgments. Such factors 
may be more important than the formal char- 
acteristics of the judgmental situation: the 
type of clinical material judged or the method 
of recording the judgments. The present study 
was designed as a preliminary investigation of 
the possible relation between certain objective 
personality traits and judgmental ability. 

Undergraduate college Ss (VN = 96) ranked 
for global adjustment level 10 short clinical 
case histories with approximately half of the 
Ss ranking a set of 10 cases of adult clinic 
patients and the other half ranking 10 case 
histories of disturbed children. The Ss also 
completed the 15-scale Edwards Personal 
Preference Schedule (1). The rankings of 
each S were correlated with similar rankings 
of the same cases by 6 psychiatrists (Adult 
cases) or 7 clinical child psychologists (child 
cases) and the Ss dichotomized on the basis 
of the magnitude of their agreement with 
these experts. The Ss were further divided 
into two replicated groups and the high and 
low agreement Ss of Group I (NV = 56) com- 


1An extended report of this study may be ob- 
tained without charge from A. W. Bendig, Dept. of 
Psychology, University of Pittsburgh, Pittsburgh 13, 
Pa., or for a fee from the American Documentation 
Institute. Order Document No. 5039 from ADI Aux- 
iliary Publications Project, Photoduplication Service, 
Library of Congress, Washington 25, D. C., remit- 
ting in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. 


pared on each of the 15 scales of the PPS 
by two criterion (case type and agreement 
group) analysis-of-variance procedures. Two 
of the PPS scales, Achievement and Exhibi- 
tion, significantly (.01 level or better) dis- 
criminated between the high and low agree- 
ment groups, while the interaction of case 
type and agreement group was significant 
(.05 level) with the Order scale. The high 
agreement group was higher on both the 
Achievement and Exhibition scales, while the 
high agreement Ss ranking the adult cases 
and the low agreement Ss ranking the child 
cases scored higher on the Order scale. Simi- 
lar analyses of variance were computed on 
these three scales for the replication Group II 
(V = 40) and none of the resulting F ratios 
approached significance. Again the high agree- 
ment Ss showed high mean scores on the 
Achievement and Exhibition scales, but the 
differences were not significant. 

The obvious conclusion of this study is that 
there are no consistent relationships between 
the scores of naive Ss on the scales of Ed- 
wards’ PPS and their agreement with experts 
in judging clinical case histories. However, 
this does not exclude the possibility that other 
personality measures might show some corre- 
lation with judgmental ability. 

Brief Report. 
Received July 6, 1956. 
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Comparison of Two Methods of Electroshock in 
Their Effect on Cognitive Functions’ 


Barbara S. Kendall, Warren B. Mills, 


Washington University School of Medicine 


and Thomas Thale 
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Electroconvulsive therapy (ECT) commonly 
produces temporary confusion and disturb- 
ance of attention and of memory in the 
treated patient (6), or, as Stone (10) points 
out, a general impairment of cognitive func- 
tions. These unfortunate effects may accumu- 
late during the treatment period, but tend to 
disappear within two or three weeks after the 
final shock. It has usually been assumed that 
ECT by the Reiter apparatus, because a 
lesser amount of current is necessary to pro- 
duce a seizure, results in less disturbance to 
cognitive functions than does the conven- 
tional method (1, 2, 12, 13). Yet there have 
been no well-controlled studies dealing with 
measured differences in the effect on “test 
ability” of these two methods. 

The aim of the present study is to test the 
following hypotheses: 


1. For the same therapeutic result ECT by 
the Reiter method causes a lesser amount of 
impairment of cognitive functions, demon- 
strable at the end of the shock series, than 
does ECT by the conventional method. 

2. Patients shocked by the Reiter method 
show less residual cognitive disturbance two 
weeks after the final convulsion than do pa- 
tients treated by the conventional method. 

3. Two weeks after the last convulsion, 
score gains in cognitive tasks over preshock 


1The authors acknowledge their indebtedness to 
Dr. Goldine C. Gleser for help in planning the sta- 
tistical treatment of the data and to Dr. Laverne C. 
Johnson for reading the manuscript and for mak- 
ing valuable suggestions. This research was done at 
the Malcolm A. Bliss Psychopathic Hospital. 


performance are greater and losses are less 
for patients treated by the Reiter method 
than for those treated by the conventional 
method. 


Method 


Amount and type of treatment. Two groups 
of patients treated at the Malcolm A. Bliss 
Hospital are compared with reference to cog- 
nitive changes induced by ECT; one group, 
in which all were treated with the Reiter type 
46 E machine (which delivers pulsating “uni- 
directional” current), will be referred to as 
the “Reiter group”; the “Conventional group” 
consists of those receiving treatment with the 
Lektra machine model 302 (which puts out 
a single burst of unmodulated 60-cycle sine 
wave alternating current). In order to insure 
uniformity, all patients were treated with a 
bitemporal placement of the electrodes, and no 
patient was presedated. This procedure was 
the usual one followed by this hospital in ad- 
ministering ECT. Treatments were scheduled 
three times a week until a full course had 
been run. Since it had been the experience in 
this institution that 20 convulsive reactions 
on the Reiter machine were usually required 
to induce approximately the same therapeutic 
result as that induced by 12 convulsions on 
the conventional type machine, each patient 
in the Reiter group received 20 and each one 
in the Conventional group 12 treatments. In 
a few instances, because of mechanical and 
other difficulties, single interruptions of the 
treatment rhythm resulting in intervals of 
four to seven days (instead of the expected 
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two to three days) were unavoidable. These 
occurred only once for any one patient, in- 
volved four individuals in the Conventional 
and nine in the Reiter greup, and tended to 
be longer in the latter group. Later it will be 
shown that these single interruptions did not 
alleviate the detrimental effect of ECT. One 
patient in the Reiter group received her six- 
teenth treatment on a Medcraft (conven- 
tional) machine. 

Sampling. The two groups of 20 patients 
each were matched for age, education, and 
intelligence. The mean age of the Reiter 
group was 30.40 years with a range of 21 to 
52 years and an SD of 9.15; the mean age 
of the Conventional group was 34.35 years 
(range 20 to 59 years and SD of 9.59). For 
the Reiter group the mean number of years 
of education was 9.70 with a range of 6.50 
to 15.00 and an SD of 2.51; for the Con- 
ventional group, it was 10.08 with a range 
of 4.00 to 16.50 and an SD of 2.89. The 
Wechsler-Bellevue Vocabulary was used to 
obtain a stable measure which would approxi- 
mate original intellectual endowment. Form I 
was given before treatment and Form II two 
weeks after the close of treatment; which- 
ever rating was higher was used for this score. 
The mean weighted score of the Reiter group 
was 10.70 points with a range of 7 to 16 and 
an SD of 2.41; that for the Conventional 
group was 10.75 points, with a range of 4 to 
16 and an SD of 2.79. 


There were 9 men and 11 women in the Reiter 
group and 8 men and 12 women in the Conven- 
tional group. Psychiatric diagnoses for the Reiter 
group were distributed as follows: schizophrenic re- 
action, 15 (of which 4 were of schizo-affective type) ; 
paranoid state, 2; involutional psychotic reaction, 2; 
and manic-depressive reaction, depressive type, 1. 
For the Conventional group they were: schizo- 
phrenic reaction, 16 (of which 2 were of schizo- 
affective type); paranoid state, 1; involutional 
psychotic reaction, 1; manic-depressive reaction, de- 
pressive type, 1; and depressive reaction, 1. None of 
the patients had received any type of convulsive 
treatment within the previous year; no case with 
known brain damage was included. 

It was not possible to match the two groups for 
severity of illness, and selection from this point of 
view was necessarily at random. However, certain 
rough checks may be applied. Duration of illness is 
not significantly different in the two groups: 9 pa- 
tients in the Reiter and 8 in the Conventional group 
had been ill for one year or less; comparable figures 


when the criterion is two years are 12 and 10. An 
index of Wechsler-Bellevue scatter which Pascal and 
Zeaman (8) have shown to differentiate between de- 
grees of severity of psychiatric illness yields means 
of 9.35 for the Reiter and 8.06 for the Conventional 
group; the difference is not significant (¢ equals 
1.01). 


Psychological tests. In order to measure 
the amount of postshock confusion two short 
test batteries were set up, to be administered 
on the day of the first treatment, on the day 
when half the course of electroshock was com- 
pleted (in the case of the Reiter group on the 
tenth treatment day and for the Conventional 
group on the sixth day), and, finally, on the 
last day of treatment. 


The first battery, consisting of short tests of Pic- 
ture Absurdities (7), Substitution (7), Cancellation 
(Whipple), and Cube Imitation (Knox), was invari- 
ably given 30 minutes after the convulsion, and will 
be referred to as the Morning Battery. The second, 
used (with few exceptions) three to three and a 
half hours after the convulsion, will be called the 
Noon Battery; this included naming the months in 
reverse, subtracting serial 7’s (9), and serial 3’s, 
naming colors (9), duplicating from memory pat- 
terns of colored blocks (9), and identifying from a 
pack of 15 cards 5 which carried words just previ- 
ously used as stimulus words in an opposites test 
(5). Two patients in the Conventional group, one 
an aggressive chronic paranoid and the other a 
catatonic in her first episode, refused to cooperate 
in the Noon Battery on the first shock day, but 
gave adequate cooperation three hours after the 
second treatment. Where possible, alternate forms of 
the tests of these batteries were arranged for use at 
the second and third sessions. No change seemed 
practicable in Months Reversed, Picture Absurdi- 
ties, Cube Imitation, Color Naming, or the Ruesch 
Blocks; but slight changes were made by shifting 
the key in Substitution, crossing out a different let- 
ter in Cancellation, and beginning with a different 
number in Serial 7’s and Serial 3’s; for the Recog- 
nition test the subject was asked to find opposites 
for a new set of words. Thus, although transfer of 
training could not be eliminated, the influence of 
previous practice was minimized for half of the 
tasks without introducing, except in the case of 
Recognition, a real uncertainty about equivalence of 
difficulty. 


In order to have a measure of the rate of 
recovery from the temporary intellectual in- 
efficiency which is the immediate cumulative 
effect of ECT, the Wechsler Memory Scale 
Form I was administered to all patients im- 
mediately after the Noon Battery on the final 
shock day, and this was followed approxi- 
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mately two weeks later with Form II of the 
same scale. Longer-term losses and gains in 
intellectual efficiency which resulted from 
the treatment were evaluated by means of 
the Wechsler-Bellevue Scale. When possible, 
Form I was given before the first shock, and 
Form II 13 to 15 days after the last shock. 


Results 
Measures of Postshock Confusion 


In the analysis of results T scores were 
computed, according to the method of Mc- 
Call, for each subtest of the Morning and 
Noon Batteries, and these were based (with 
one exception) on all three performances of 
both groups in the same or equivalent tasks. 
The Recognition test was given after the first 
and final shocks only, and, consequently, 80 
scores, instead of 120, were used in the calcu- 
lations. It would not have been useful to score 
the subtests of the battery by already estab- 
lished norms (where these existed) because 
the scores earned after ECT tended to ac- 
cumulate at the lower end of each distribu- 
tion to an extent which precluded effective 
differentiation. It was also undesirable to 
normalize for each administration separately 
because the scores would have failed to show 
the extent of the changes which occurred in 
the course of treatment. The method chosen, 
i.e., the normalizing of scores of each subtest 
for all three administrations, does not repre- 
sent an attempt to establish standards of per- 
formance, but is rather a means of providing 
a basis for comparisons between Reiter and 
Conventional groups and among the three 
administrations of the subtests. 

In Table 1 mean scores for each group in 
each subtest administered on the first day of 
treatment are compared with mean scores for 
those given on the last day of treatment, and 
a cross comparison is made of the differences 
between gains and losses shown by both 
groups. Means were computed for midcourse 
measures, but, since they add little to the 
analysis, are not included in the table. 

Changes induced by ECT. Test-retest 
changes are considered separately for each 
experimental group. Decrements significant 
at or beyond the 2% level of confidence occur 
for both groups in Substitution and Serial 7’s, 


and at or beyond the 5% level in Cancella- 
tion. For one or the other group losses sig- 
nificant at or beyond the 1% level are found 
in Picture Absurdities, Serial 3’s, and Recog- 
nition, and at the 5% level in Cube Imita- 
tion and Months Reversed. Thus 8 of the 10 
subtests are shown to be more or less sensi- 
tive to the cognitive changes induced by 
ECT. In the case of Color Naming, however, 
the effect of previous practice appears to 
cancel out the damaging effect of ECT; and 
Block Design actually shows a gain, though 
for the Reiter only, which is significant at the 
5% level. 

Test of Hypothesis I. The difference in 
gain and loss from first to final shock is ex- 
pressed as the average change for Reiter 
minus the average change for Conventional. 
A comparison of the two groups with regard 
to these score decrements or increments yields 
only one significant difference, that for Rec- 
ognition. A possible reason why this test of 
memory and alertness made the differentia- 
tion where the other tests of postshock con- 
fusion failed to do so is that transfer of 
training is more successfully minimized for 
this task than for any of the others, partly 
because a new set of stimulus words is used 
in the retest, and partly because the mid- 
course retest is omitted. It may be that for 
the other subtests the effect of previous prac- 
tice offset in some degree the effect of con- 
fusion, but in a test of new learning the 
Reiter group was better able to demonstrate 
superiority. 


Memory Scale 


Changes in first two weeks of recovery pe- 
riod. In contrast to the experience with tests 
used to measure changes taking place during 
the course of treatment, practice effects in the 
case of the Wechsler Memory Scale, which is 
used as a measure of recovery, tend to inflate 
the retest scores and make the differences 
artificially greater. In Table 2 the mean 
Memory Quotients on the last day of ECT 
are compared with results of a second ad- 
ministration two weeks later. For both groups 
the test-retest gains are significant beyond 
the 1% level of confidence. 

Further test of Hypothesis I. While the 
majority of the tests of postshock confusion 
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Table 1 


Measures of Postshock Confusion: Means, Sigmas, and Measures of Difference in 
Reiter (R) and Conventional (C) Groups (7 Scores) 


First shock (1) Final shock (3) 
Diff. 
Test Mean Sigma Mean Sigma 1-3 t Dr—-De t 

Morning battery 

Picture Abs. 
R 52.60 6.04 46.20 9.35 6.40 2.86*** 2.10 63 
Cc 52.10 7.30 4780 9.28 4.30 1.76 

Substitution 
R 54.60 11.04 45.85 9.52 8.55 3.88*** 1.25 35 
Cc 53.90 9.43 46.40 8.27 7.50 2.72** 

Cancellation 
R 52.30 9.23 47.25 9.71 5.05 2.12* © —.40 13 
Cc 53.05 9.34 47.60 9.91 5.45 275° 

Cube Imitation 
R 54.40 8.61 50.05 8.86 4.35 2.21* 85 .23 
Cc 49.40 10.45 45.90 9.33 3.50 1.13 

Noon battery 

Months Rev. 
R 53.75 10.79 48.15 11.50 5.60 2.32* 2.60 92 
Cc 50.95 9.28 47.95 8.64 3.00 2.02 

Serial 7’s 
R 54.95 10.78 48.30 9.18 6.65 “70° 1.65 84 
Cc 52.00 10.27 47.00 7.93 5.00 4.23*** 

Serial 3’s 
R 50.60 12.69 49.10 8.77 1.50 59 —2.20 79 
Cc 51.35 9.63 47.65 8.56 3.70 3.29*** 

Color Naming 
R 51.05 13.00 48.63 10.62 2.42 1.08 —.58 a7 
Cc 51.15 10.25 48.15 8.26 3.00 1.18 

Blocks 
R 46.55 10.50 52.90 9.85 —6.35 2.42* —3.75 1.15 
C 47.55 9.43 50.15 3.44 —2.60 1.35 

Recognition 
R 51.00 6.98 53.60 9.26 —2.60 1.01 —9.80 2.73 
Cc 50.70 9.37 43.50 7.10 7.20 2.86*** 


* Significant at the 5% level of confidence. 

** Significant at the 2% level of confidence. 

*** Significant at the 1% level of confidence. 
Note.—Means are rounded. In all cases higher scores indicate more favorable performance. 


fail to bring out clear-cut differences between 
the effects of the two methods of administer- 
ing ECT, results for the Wechsler Scale on 
the final shock day give striking evidence 
that the Reiter group shows less cognitive 
disturbance. Significance beyond the 1% level 
of confidence is indicated. 

A comparison of subtest scores reveals 
that the Reiter group surpasses the Conven- 
tional in several measures of new learning, 
i.e., in Logical Memory, Memory for De- 
signs, and the “hard” items of Associate 
Learning; differences of 2.73, 1.90, and 1.05 


are significant (in the first instance at the 1% 
level, and for the last two at the 5% level of 
confidence) ; ¢ is, respectively, 2.90, 2.08, and 
2.10. No other difference in subtest scores be- 
tween the two groups (except in Orientation) 
approaches statistical significance. 

Because the two groups are matched for 
intelligence, age, and education it is assumed 
that differences found at the first administra- 
tion of the Memory Scale reflect differences 
in the amount of cognitive impairment in- 
duced by the ECT series. An attempt to 
match the groups for the memory factor by 
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Table 2 


Measures of Recovery from Postshock Damage: Means and Sigmas in Reiter (R) and 
Conventional (C) Groups 


Pretreatment (I) Final shock (I) After 2 wks. (II) 
Test Mean Sigma Mean Sigma Mean Sigma 
Wechsler MQ 
R 76.95 11.36 90.60 20.28 
Cc 66.55 8.86 81.90 16.95 
Wechsler-Bellevue IQ 
R(V 20) 98.85 12.68 95.05 13.26 
C(N 16) 103.31 12.27 99.50 10.16 
C(N 17) 103.47 11.92 
C(N 19) 96.42 13.30 
Differences and their significance 
I-II t RI-—CI t RII-—CII ¢ Da-Dce 
Wechsler MQ 
R —13.65 3.47** 10.40 3.15** 8.70 1.44 —1.70 .32 
Cc —15.35 4.39** 
Wechsier-Bellevue IQ 
R 3.80 1.89 —4.62 1.11 —1.37 —0.01 O01 
Cc 3.81 2.19* 


* Significant at the 5% level of confidence. 
** Significant at the 1% level of confidence. 


Note.—The Wechsler- Bellevue data for the Conventional group are incomplete; one patient was mute and two had been given 
only four subtests prior to shock; a fourth refused the posttest Performance Scale. Differences are calculated for the varying num- 


bers of patients who gave full records in the different situations. 


using the scale as a pretest or as a posttest 
would have been illusory. The groups com- 
prised a variety of psychiatric conditions, 
and the schizophrenic patients were in differ- 
ent stages of the illness from acute to chronic. 
Any pretest, other than vocabulary, would 
have fallen far short of representing inherent 
intellectual endowment. The remission of the 
psychosis brought about by ECT should be 
maximal immediately after the conclusion of 
the series, and its effect more nearly equal 
for all patients than at any other time, but 
this effect was essentially nullified for cog- 
nitive tests by the intellectual impairment 
which all the experimental patients suffered. 
Two weeks later, when the retest was made, 
some patients had begun to lose the benefits 
accruing from the treatment while others 
maintained their gains. In either situation 
the establishment of approximate equivalence 
for the two groups in a sensitive measure was 
not possible. Matching for the more stable 
attributes of vocabulary, age, and education, 
though not ideal, was the most that could be 
achieved. 


Test of Hypothesis 11. Two weeks after 
the last treatment the difference between the 
means of the two groups in the Memory 
Scale, though not greatly diminished, is no 
longer statistically significant, and the dif- 
ference in the extent of test-retest changes, 
i.e., in mean gains or losses, is negligible. A 
wider spread of scores, shown in what is al- 
most a doubling of the standard deviations, 
reflects a variability in immediate outcome. 
Stone (11) has found the Wechsler Memory 
Scale to be “remarkably sensitive” to the 
temporary intellectual impairment associated 
with ECT. It seems also to be sensitive to 
changes which, in those cases which derive 
no lasting benefit, are associated with a re- 
activation of the psychosis for which a tempo- 
rary remission has been effected by ECT. Ten 
patients of the Reiter and 12 of the Conven- 
tional group had improved sufficiently to be 
slated for an early discharge. Among the 
remaining patients, three were signed out 
against advice, a few were discharged later, 
but others required long-term care. The mean 
gains for the “good” outcome group—20.00 
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—and for the “poor” outcome group—7.78 
—differ by 12.22 points, and this difference 
is significant at the 2% level. 

Effect of interrupted treatment. Since the 
amount of gain shown in the final administra- 
tion of the Memory Scale might be thought 
of as a rough measure of the extent of im- 
pairment of brain function which resulted 
from the full course of ECT—impairment 
which was actually reversed in the two-week 
period following treatment—an attempt is 
made to use this measure to determine 
whether the patients whose courses of treat- 
ment were interrupted had suffered less tem- 
porary detrimental effect as a consequence. 
Eleven patients in the Reiter group who had 
uninterrupted treatment were compared with 
9 in the same group for whom 4-, 5-, and 7- 
day intervals occurred once for each case. 
The mean gain for the former group was 5.45 
and for the latter 23.66; the difference is sig- 
nificant at the 2% level (¢ = 2.67), but is not 
in the anticipated direction. This difference 
is not explained by differences in intelligence 
level, or by a significant inferiority in the 
“continuous treatment” group immediately 
after the course of ECT. Although it appears 
that improvement in the first two weeks fol- 
lowing shock was somewhat related to intelli- 
gence (the product-moment correlation be- 
tween gains and Vocabulary score being .40, 
significant at the 1% level of confidence), 
differences in Vocabulary and in the preshock 
Wechsler-Bellevue IQ between patients of 
the Reiter group having interrupted treat- 
ment and those having continuous treatment 
(.64 and 8.66, respectively) are not signifi- 
cant (¢= .54 and 1.49). For the memory 
quotient earned on the final shock day the 
mean of the “interrupted treatment” group 
surpassed that of the “continuous treatment” 
group by only 3.12; the difference is not sig- 
nificant (¢ = .55). We are at a loss to ex- 
plain why a greater amount of reversible cog- 
nitive disturbance should occur to those pa- 
tients whose treatment, because of one un- 
duly long rest interval, was spread over a 
longer period of time. There is no evidence 
that these brief interruptions operated to al- 
leviate the disrupting effect of treatment. 
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Wechsler-Bellevue Scale 


Changes pre- to postshock. Also in Table 2 
are shown mean IQs and their standard devia- 
tions for two administrations of the Wechsler- 
Bellevue Scale; Form I was given before the 
start of ECT and Form II a fortnight after 
its completion. Though in individual cases 
the changes may be marked (the range be- 
ing 30 IQ points lost to 9 IQ points gained), 
for each group as a whole a slight decrement 
is found on retest, which is significant, at the 
5% level, in the case of the Conventional 
group and approaches significance for the 
Reiter group. The experience of Gerboth (3) 
and of Hayes and Schneider (4) suggests that 
with normal subjects a negligible gain results 
when Form II follows Form I. Here the ex- 
pectation would be that remitting psychosis, 
present in more than half the cases, would 
bring about an increase in scores, but, pre- 
sumably because after two weeks there is still 
residual impairment of cognitive functions, 
this effect is not found. Confirming evidence 
that intellectual loss might still be present 
on retest is brought out in one qualitative 
feature, a difficulty with word finding, which 
occurred in a few instances. In responding to 
Picture Completion two weeks after the end 
of the shock series, four patients, though 
they indicated by pointing that they under- 
stood what was missing in item 3, were un- 
able to recall the word Ainge, and five others 
recalled it only after a marked hesitation. 

Test of Hypothesis III. The difference be- 
tween the experimental groups in average 
changes (gains or losses) in Wechsler-Belle- 
vue IQ from first to final administration is 
negligible. Therefore the hypothesis that pre- 
to postshock average gains would be greater, 
and average losses less, for the Reiter than 
for the Conventional group is not sustained. 


Summary 


In order to compare the detrimental effect 
on cognitive functions of two different meth- 
ods of administering ECT, two groups of 20 
patients each, matched for age, education, 
and intelligence, were subjected to ECT, the 
first group to 20 convulsions by the Reiter 
method, and the other to 12 convulsions by 
the Conventional method. Various psycho- 


J 
© 
? 
© 
te 
4 
‘4 


Effect of Electroshock on Cognitive Functions 429 


logical measures were used before shock, dur- tion of the Wechsler Forms I and II with 
ing the course of shock, and two weeks post- —, defectives. J. clin. Psychol. 1951, 7, 
shock. “ 5. Jellinek, E. M., & Bolles, Marjorie. The effect of 
At the close of the treatment period less bromides on certain psychological test per- 
cognitive disturbance, as measured by cer- formances. J. Psychol., 1942, 14, 277-293. 
tain tests involving new learning, had oc- 6. Kalinowsky, L. B., & Hoch, P. H. Shock treat- 
curred in the Reiter group, but differences ments, psychosurgery, and other somatic pro- 


were negligible for other tasks. In the two- Sa.) Now York: 


week postshock period gains in the Wechsler 7. Kuhlmann-Anderson Intelligence Tests. (5th 
Memory Scale were approximately equal for Ed.) Minneapolis: Educational Test Bureau, 
the two groups. Comparison of pre- to post- 1942. 

shock average changes in the Wechsler-Belle- °- Pascal, G. R., & Zeaman, Jean B. A note on the 


vue Scale gave no advantage to either group. scatter. Amer. J. 


Received January 24, 1956. 9. Ruesch, J. Intellectual impairment in head in- 
juries. Amer. J. Psychiat., 1943, 100, 480-496. 
10. Stone, C. P. Losses and gains in cognitive func- 
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Group Psychotherapy with Chronic Psychotic 
Soiling Patients’ 


John E. Tucker 
VA Hospital, Albany, N. Y. 


This study was designed to test the effec- 
tiveness of group psychotherapy under con- 
trolled experimental conditions with an ob- 
jective criterion. Twenty male psychotic Ss 
without detectable organic bases for their 
soiling habits were selected from a ward of 
38 soilers. A soiler was defined as a psychotic 
patient with urinary or fecal incontinence. 

The frequency of soiling for the 20 Ss was 
measured in a 15-day pre-experimental period 
using standard tally sheets kept by trained 
ward nurses. The 20 Ss were given a brief 
test of mental alertness which gave a quanti- 
tative score. The Ss’ ages were taken from 
medical records. The Ss were paired by trial 
and error into 2 groups of 10 each so that 
the means and SDs of their ages, soiling fre- 
quencies, and mental-alertness test scores were 
not statistically different between the 2 groups 
(critical ratios for Ma: .590, .614, .870). The 
Ss were diagnosed chronic psychotics. One 
group was randomly designated the control 
group and did not have group therapy. The 
other group was the experimental group and 
was given 45 hours of group therapy. 

Therapy was oriented toward increasing 
verbal communication. The Ss were regressed 
and disorganized. Transference was fostered 


1An extended report of this study may be ob- 
tained without charge from John E. Tucker, VA 
Hospital, Albany, New York, or for a fee from the 
American Documentation Institute. Order Document 
No. 5040 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library 
of Congress. 


by feeding the Ss sweets. No overt changes 
were noted in Ss’ psychotic behavior but they 
showed friendliness toward the therapist and 
did not resist group meetings. The Ss achieved 
some psychological group structure as shown 
by consistent choice of seats, the formation 
of submissive-dominant relationships, and 
some role-taking. 

During the 9-week experimental period the 
20 Ss stayed on their usual ward and did not 
have any change in their living routine ex- 
cept for group therapy for the 10 experi- 
mental Ss. The identities of the 10 controls 
were concealed from ward personnel. The 
tally of soiling was kept over the 9-week pe- 
riod for both groups. At the end of the pe- 
riod the pre-experimental soiling data were 
compared with postexperimental and mean 
gain differences were analyzed. Controls de- 
creased 1.1 in their daily soiling average. Ex- 
perimentals decreased 6.7 in theirs. The 5.6 
difference favoring the experimental group 
mean gain was statistically significant, p less 
than .05. 

The selection and matching procedures em- 
bodied in the design encourage the hypothe- 
sis that the greater reduction in soiling be- 
havior in the experimental Ss resulted from 
group therapy. Possibly, group therapy was 
a sufficiently potent technique to modify 
early learned behavior patterns such as toilet 
habits. Use of group therapy with other be- 
havior pattern disturbances found in psychi- 
atric hospitals such as assaultiveness or elope- 
ment suggests itself. 


Brief Report. 
Received July 24, 1956. 
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The Short-Term Effects of Chlorpromazine on 
Psychiatric Patients 


William D. Winter and Wilbur K. Frederickson 


Hospital for Mental Diseases, Milwaukee, Wisconsin * 


The increasing use of chlorpromazine and 
other tranquilizing drugs in the treatment of 
mental disorders poses practical and theo- 
retical problems for the clinical psychologist 
working in this area. Experience with these 
drugs has demonstrated their efficacy in mak- 
ing disturbed patients more manageable and 
reducing the need for restraints, sedative 
packs, and seclusion (5). Used in conjunc- 
tion with psychotherapy, the drugs have re- 
placed to a large extent the more drastic 
“shock” therapies as the treatment of choice. 
The exact physiological and psychological ac- 
tion produced by these drugs is still largely 
unknown. We see clinically a reduction of 
anxiety, agitation, and emotional lability 
in the patient which makes him more ame- 
nable to psychotherapy. It is still undeter- 
mined whether these reactions indicate that 
basic personality changes are being effected 
in an individual or whether the drugs only 
enabie the patient to work out his problems 
more effectively by lessening the confusion 
and intense emotionality set off by unbear- 
able conflicts. 

One purpose of the present study will be a 
preliminary investigation into the nature of 
personality changes produced under chlor- 
promazine, as measured by ward behavior 
and psychological tests. Our first hypothesis 
is that within a week’s time no significant 
personality trait changes, as measured by the 
test used in this study (MMPI), would oc- 
cur as a direct result of chlorpromazine treat- 


1The authors wish to express their appreciation 
of the assistance given them in carrying out this 
study by Christine Ivanov, M.D., Donald Schaefer, 
M.D., Robert Lichtenstein, M.D., and the nursing 
staff of this hospital. 
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ment. We, however, predict an improvement 
in overt ward behavior, related to a decrease 
in subjective anxiety. 

Another general problem investigated, a 
practical one for psychologists, concerns the 
validity of administering tests involving 
visual-motor coordination to patients under- 
going chlorpromazine treatment. In this hos- 
pital most patients are routinely put on some 
form of drug therapy a few days after their 
admission. Physical symptoms due to the 
physiological action of the drugs occur shortly 
after the patients begin this therapy. These 
symptoms usually include blurred vision, diffi- 
culty in coordination, and dizziness. A test 
involving visual-motor coordination, such as 
the Bender-Gestalt test, can be expected to 
be particularly vulnerable to misinterpreta- 
tion if the side reactions to the drugs signifi- 
cantly affect test performance. Our testing 
experience with patients undergoing drug 
therapy led us to predict that tests involving 
a visual-motor task would be contaminated 
by the physical side effects of the drug. 


Methodology 


The subjects used in this study were 59 
female patients newly admitted to the Mil- 
waukee County Hospital for Mental Dis- 
eases. These patients represented a variety 
of diagnostic categories, but all those with 
known or suspected organic impairment or 
those in need of emergency care were ex- 
cluded from the sample. The age range of the 
subjects was from 18 to 45 years, with a 
median age of 33.5. The median educational 
level was 11.5 years of schooling, and the 
median mental age was 15.5, as estimated 
from the Shipley-Hartford scale. 
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The test battery included the Bender- 
Gestalt test, the group form of the MMPI, 
and the Shipley-Hartford scale. The Gardner 
Behavior Chart (8) was used to estimate ad- 
justment to ward routine. A check list show- 
ing the deviation of the patient’s drawings 
from the stimulus figures was used to score 
the Bender-Gestalt test. This check list was 
developed by the authors at the suggestion 
of Max Hutt. This list incorporates many of 
the items in Pascal and Suttell’s scoring sys- 
tem (6), and also includes deviations such as 
constriction and overexpansiveness, which are 
found in Hutt’s criteria (4). Unlike Pascal’s 
system, differential weighting of specific devi- 
ations was not attempted. Scoring reliability 
for this check list was found to be .91. The 
scoring of the MMPI followed the practice 
of adding the designated percentage of K to 
the necessary diagnostic scales (3). 

As patients were admitted to the hospital, 
they were randomly assigned to two groups. 
Both groups received placebos, which were 
identical to the chlorpromazine tablets in 
size and color, for three days. At the end of 
this time, each patient was interviewed and 
tested with the Bender-Gestalt test and the 
MMPI, and a nurse who was familiar with 
the patient filled out the Gardner Rating 
Chart on her. On the fourth day, the pa- 
tients in the Drug group began receiving 50 
mg. tablets of chlorpromazine (thorazine) 4 
times a day, and the Placebo group con- 


tinued getting placebos. Seven days later, 
both groups were retested on the Bender and 
MMPI and another Gardner Rating Chart 
was filled out. The Shipley-Hartford scale 
was also administered at this time. At no 
time did the physicians, nurses, or psycholo- 
gist doing the testing know which of the two 
coded treatment routines involved the use of 
chlorpromazine. The Bender records were also 
coded so that the scorers would not know 
whose test was being scored or whether the 
protocol was from the first or second test 
administration. 

The two groups were not significantly dif- 
ferent on the variables of age, education, in- 
telligence, number of admissions to the hos- 
pital, or diagnostic category. 


Results 


In Table 1 are listed the test-retest reli- 
ability coefficients for each of our measures 
with the time interval of one week between 
tests. These correlations are based on the 
scores of the drug and placebo groups com- 
bined, and are similar to the average reli- 
abilities found in previous studies (1, 3, 7). 
The median reliability for the placebo group 
alone was .75; for the drug group, .745. 

The changes occurring in the test scores 
over a one-week period are given in Table 2 


Table 2 


Significance of Changes Occurring in a One-Week 
Period for Entire Sample 


Table 1 
Test-Retest Reliability Coefficients Mean 
score 
onfirst Mean 
Measure N r Measure N testing change p 

Ward Ratings 51 86 Ward Ratings 51 36.8 +3.1 <.01 
Bender-Gestalt 44 .80 Bender-Gestalt 44 27.5 —2.6 <.01 
MMPIT Scores MMPIT Scores 

L 31 78 L 31 57.2 +1.7 <.20 

F 31 76 F 31 60.1 —1.9 <.20 

K 31 65 K 31 59.4 +2.5 <.10 

Hs 31 76 tr. 31 58.6 —1.2 <.40 

D 31 79 D 31 68.2 —3.2 <.20 

Hy 31 66 Hy 31 65.2 —2.8 <.20 

Pd 31 71 Pd 31 72.0 —4.7 <.02 

Mf 31 63 Mf 31 54.7 —1.5 <.30 

Pa 31 75 Pa 31 59.0 —1.9 <.30 

Pi 31 86 Pt 31 63.0 —3.9 <.02 

Se 31 77 Se 31 65.2 —41 <.02 

Ma 31 65 Ma 31 55.3 —2.7 <.20 
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for the entire sample. Change was computed 
by subtracting the first test score from the 
second test score of each subject, and the 
significance of this difference was determined 
by the use of two-tailed ¢ tests for correlated 
data. As can be seen from the table, our sub- 
jects as a group showed significant improve- 
ment in ward behavior and Bender-Gestalt 
scores, and a significant decrease on the 
Psychopathic Deviate, Psychasthenia, and 
Schizophrenia scales of the MMPI. They also 
showed a tendency toward a higher K score 
on the MMPI at the time of their second 
testing. 

Inspection of the data revealed that the 
two groups differed significantly on several 
MMPI scales on the original testing, there- 
fore, the drug and placebo patients were 
matched as closely as possible on the basis 
of their initial scores on each measure, in 
order to determine which group changed 
more in one week’s time. Comparison of the 
mean changes of the two groups by use of 
the ¢ test for uncorrelated means revealed 
only three variables in which the differences 
approached significance at between the .10 
and .05 level of confidence. The patients re- 
ceiving drugs improved more than those tak- 
ing placebos on the Ward Ratings, and the 
placebo group improved more on the Bender. 
The drug group showed significantly higher 
scores on the K scale of the MMPI. The 
number of subjects compared on Ward Rat- 
ings was 42, on the Bender-Gestalt 32, and 
on the K scale 22. 


Discussion 

Analysis of the data indicates that in a 
period of one week the patients as a group 
showed small but statistically significant 
changes in Ward Behavior ratings, Bender- 
Gestalt scores, and on several MMPI scales. 
This is congruent with our own clinical ex- 
perience with acutely disturbed patients, and 
with the result of other studies (7); the 
severity of their symptoms often decreases 
markedly after only a few days of hospitali- 
zation. The problem of the depth and per- 
manence of these changes cannot be com- 
pletely solved. It should be pointed out that 
our data are limited and we cannot precisely 
evaluate the effects of practice, the lack of 


independence of the MMPI scales, the short 
time interval, and the use of only a moderate 
drug dosage. 

When we compare the mean changes of the 
drug group with those of the placebo group, 
we do not find any outstanding differences. 
There is a trend, however, for the drug group 
to show better ward adjustment, poorer co- 
ordination on the Bender-Gestalt, and a 
higher K score. The first two findings are in 
the predicted direction. The use of chlorprom- 
azine reduces subjective anxiety and exces- 
sive emotional lability, which makes the pa- 
tient easier to get along with and more 
approachable. At the same time, particularly 
in the early phases of drug therapy, the pa- 
tient suffers from many side reactions such 
as dizziness, blurred eyesight, poor coordi- 
nation, and physical unrest, all of which 
could easily affect performance on the Bender- 
Gestalt test. 

Apparently none of the usual diagnostic 
scales on the MMPI were differentially af- 
fected by the drug. The test-retest reliability 
figures for the two groups on the MMPI are 
similar to those cited in the MMPI manual 
and in other studies (3, 7). This suggests 
that the use of chlorpromazine therapy with 
the patients resulted mainly in superficial 
behavioral improvement which may have in- 
volved a suppression of symptoms rather than 
deep personality changes. There could, of 
course, be an increase in ego strength which 
may lead to eventual major changes in per- 
sonality structure, but the data shed no 
light on the question. It is interesting that 
other studies in this area have shown little 
or no change in psychological tests after drug 
therapy despite such overt changes in be- 
havior as to markedly lessen the need for 
sedative packs, electric shock, seclusion, etc. 
(2, 5). 

A comment should be made concerning the 
need for using control groups when doing re- 
search in this area. As mentioned above, a 
certain percentage of acutely ill patients, 
even when not given any special kind of 
treatment except for routine hospitalization, 
tend to get better. When control groups can- 
not be used, it is easy to attribute this im- 
provement to whatever therapy is being ad- 
ministered. In one case a physician hesitated 
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about putting a patient on the project be- 
cause she was quite disturbed and he wanted 
her to receive electric shock. She was never- 
theless placed on the project where she made 
spectacular improvement in her ward behav- 
ior. The physician was reassured that the pa- 
tient was at least getting some drug therapy, 
but a later check of the records showed that 
the woman was in the placebo group. Thus 
the physician was falsely attributing her spon- 
taneous (i.e., of unknown cause) recovery to 
the action of the drug. 

In interviews with patients we were made 
aware of the effect of group suggestion. Many 
patients entered the hospital after reading 
optimistic newspaper reports of the “wonder 
drugs,” and every patient was exposed to the 
enthusiasm of the medical staff and to the 
comments of other patients. Sometimes a pa- 
tient in the placebo group would report not 
only a lessening of tension due to the sup- 
posed drug she was receiving, but would also 
dwell on the physical discomfort the medi- 
cation caused. 

Improvement in both groups varied con- 
siderably from individual to individual. For 
example, the change in ward behavior rating 
on the placebo group ranged from — 12 to 
— 15, and for the drug group from — 4 to 
— 14. 

The drug, then, can be said to cause some 
apparent changes for the better in a seven- 
day period, but it is not possible to say how 
central these changes are or how deep they 
go. Individual differences are great, and the 
therapeutic effects may sometimes be exag- 
gerated in uncontrolled research by neglect- 
ing the factors of spontaneous improvement, 
suggestion, and biased observation. 


Summary 
A group of acutely disturbed female pa- 
tients was tested before and after a week of 
chlorpromazine therapy, and the changes in 
test scores were compared with those of a 
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control group which received only placebos. 
The combined groups showed small but sig- 
nificant changes in Ward Ratings, Bender- 
Gestalt test scores, and on the Pt, Pd, Sc, 
and K scales of the MMPI. When patients 
were matched on their original score for each 
measure, the data showed a trend of the drug 
group to improve more in Ward Ratings and 
increase more on K, but to do less well on 
the Bender-Gestalt. Our conclusions are that, 
in general, acutely disturbed patients tend to 
improve after a few days of hospitalization 
and that chlorpromazine speeds behavioral 
adjustment to ward routine, but that it causes 
some disruption of coordination, especially 
during the so-called “turmoil phase” at the 
beginning of treatment. Evaluation of psy- 
chological tests involving visual-motor co- 
ordination, such as the Bender-Gestalt, 
should, therefore, take into account the ef- 
fects of concurrent drug therapy. 


Received April 9, 1956. 
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In the Proverbs Test (10), the subject is 
asked to tell the meaning of a proverb. For 
normal subjects, even as young as fifth 
graders, this request readily stimulates what 
Goldstein refers to as the “abstract attitude.” 
Indeed, the process of translating the con- 
crete symbols of a proverb into generalized 
concepts is almost explicitly described in one 
of Goldstein’s characterizations of abstrac- 
tion: 


In the abstract attitude, we transgress the im- 
mediately given specific aspect or sense impression; 
we abstract from particular properties. We detach 
ourselves from the given impression, and the indi- 
vidual thing represents to us an accidental sample 
or representative of a category (7, pp. 18-19). 


Benjamin (3) believed that the breakdown 
of verbal abstraction so commonly present in 
schizophrenia could be investigated by prov- 
erbs. Goldstein (8) claimed that the findings 
of the test for abstraction and concreteness 
which he and his associates had developed 
for investigating organic damage had rele- 
vance for research with schizophrenia and 
pointed out the need for more measures of 
abstraction. 

The clinical Proverbs Test consists of three 
parallel forms containing 12 proverbs each, 
scaled in the order of difficulty. Samples are: 


. Rome was not built in a day. 

. When the cat’s away the mice will play. 
. The wife is the key to the house. 

. Speech is the picture of the mind. 

. Better be happy than wise. 

. The more cost, the more honor. 


aunt 


1A paper presented at the annual meeting of 
the Southwestern Psychological Association, Dallas, 
Texas, March 23, 1956. 
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An abstract (Ad) score on this form of the 
test is secured by rating the items according 
to sample scores for each proverb. 

The multiple-choice Proverbs Test consists 
of 40 items with four answers each. It is 
scored by stencil. An abstract (Ad) score 
consists of the total number of correct re- 
sponses. A concrete (Co) score consists of 
the number of items marked from 18 “con- 
crete” answers. Samples of an easy and a 
difficult item are: 


1. The sun shines upon all alike. 
a. It’s the same sun everywhere. 
b. All are created equal. (Ab answer) 
c. The sun shines on everybody. (Co answer) 
d. People that do the same things are alike. 
2. The good is the enemy of the best. 
a. Get the best of your enemy whenever you 
can. 
b. It is best to be good even if you fail. 
c. We should love our enemies. 
d. Most people are too easily satisfied. (Ab an- 
swer) 


The Proverbs Test seems to possess use- 
ful qualities as an instrument for measuring 
impairment of conceptualization in schizo- 
phrenia. It gives a relatively pure measure of 
“verbal Comprehension” as evidenced by 
high correlations with tests heavily loaded 
with this factor (9, p. 7). 

These considerations led the author to ex- 
plore the possibilities of utilizing the Prov- 
erbs Test to investigate impairment in ab- 
straction in schizophrenia and to differentiate 
schizophrenics from normals. Reliability stud- 
ies indicated that the test was satisfactory 
when the subjects were hospitalized schizo- 
phrenic patients (10). As samples of schizo- 
phrenic populations were gathered, it be- 
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came apparent that their Proverbs Test 
scores were generally lower than those of 
normals. Since test scores are so closely cor- 
related with intelligence, however, this factor 
had to be controlled before it could be de- 
termined whether an actual difference existed. 

A pilot study was made in which 78 nor- 
mal subjects were compared with 118 chronic 
schizophrenics. Analysis of covariance was 
used to hold intelligence constant and the 
Wechsler vocabulary subtest was used as the 
measure of intelligence. The analysis yielded 
an F of 14.30 which is significant at the .001 
level of confidence. This finding seemed to 
justify a rigid test of the variation of scores 
on the Proverbs Test between schizophrenics 
and normals. 


Method 


From a sample of 332 Air Force basic air- 
men and 232 hospitalized schizophrenic vet- 
erans, 100 AF men were matched for sex, 
education, and intelligence with 100 chronic 
schizophrenics. The Word Knowledge Test 
of the Airman Classification Battery was 
used as the measure of intelligence. The 
pairs were matched exactly for stanine scores 
on the Word Knowledge Test. They were 
matched within two years for level of edu- 
cation; about two thirds matching exactly. 
The findings of this study are presented in 
Table 1. 

The abstract (Ad) score and the concrete 
(Co) score on the multiple-choice Proverbs 
Test are secured by the use of scoring sten- 


cils. Comparable scores for the individual 
(clinical) form of the test are secured by 
individual scoring according to scoring sam- 
ples for each proverb (9, 10). The table 
shows that the differences in the means of 
both the abstract and the concrete scores for 
both forms of the test are significant beyond 
the .001 level of confidence. 

Since intelligence has been held constant 
by matching each normal with each patient 
on the Word Knowledge Test, the striking 
drop from normal to schizophrenic group 
on the abstract scores and the corresponding 
rise in the concrete scores indicate an impair- 
ment in abstraction resulting from the schizo- 
phrenic condition of the patients. In this 
connection it is interesting to note the nega- 
tive correlation between the abstract and 
concrete scores on the Proverbs Test where 
independently derived. The coefficients of 
correlation between the Co of the multiple- 
choice test and the Ad score of the clinical 
test were found to be — .58 for 100 normal 
subjects and — .54 for 173 chronic schizo- 
phrenic subjects. 


Cutting Points and Their Application 


In order to establish cutting points and 
norms for use with the Proverbs Test the 
total available population of normals and 
schizophrenics was divided into intelligence 
groupings according to scores on the Word 
Knowledge Test. Such treatment of the data 
was necessitated by the correlation between 


Table 1 


Validity of the Proverbs Test for Discriminating Between Normal and Schizophrenic Men with 
Education and Intelligence Held Constant 


(100 matched pairs: V = 200) 


Air Force Chronic 
enlistees schizophrenics Mean 
Test and score M SD M SD D t p 
Multiple-Choice Test 
Ab (Abstract) score 24.6 7.4 16.7 8.7 7.9 10.34 <.001 
Co (Concrete) score 1.4 2.3 4.8 3.5 3.4 9.50 <«.001 
Clinical Test (Form I) 
Ab (Abstract) score 11.5 6.3 5.2 6.5 6.3 34.45 «.001 
Co (Concrete) score 1.8 2.6 4.7 3.5 2.9 32.22 «.001 
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Table 2 


A Comparison of Scores of Normals and Schizophrenics 
Means and SDs by Intelligence Groupings * 


Ab (abstract) score 


Co (concrete) score 


Test and score Mean SD N Mean SD 
Multiple-choice test 
High Intelligence (W.K.: 7, 8, 9) 
Normals 31.3 3.9 58 0.3 0.7 
Schizophrenics 24.8 7.3 65 2.8 2.9 
Middle Intelligence (W.K.: 4, 5, 6) 
Normals 22.8 5.4 136 1.6 2.4 
Schizophrenics 15.3 6.5 79 §.0 3.6 
Low Intelligence (W.K.: 1, 2, 3) 
Normals 16.0 4.9 58 3.9 3.0 
Schizophrenics 9.6 3.9 65 6.1 2.9 
Clinical test 
High Intelligence (W.K.: 7, 8, 9) 
Normals 19.2 2.6 58 0.2 0.6 
Schizophrenics 12.6 6.4 65 1.9 5.9 
Middle Intelligence (W.K.: 4, 5, 6) 
Normals 11.4 5.7 136 1.1 1.6 
Schizophrenics 4.9 4.7 79 2.5 4.4 
Low Intelligence (W.K.: 1, 2, 3) 
Normals 4.0 4.0 58 3.1 2.7 
Schizophrenics 1.0 1.0 65 3.9 2.4 


Note.—The normal population consisted of Air Force basic airmen from the Personnel and Training Research Center, Lack- 


land A. F. Base. The schizophrenic population consisted of a re 


Waco Veterans Administration Center. These data are taken 
are used with the permission of the publisher. 


* Based on the Word Knowledge Test of the A. F. Airman Classification Battery. 
23%, 4-6 include the middle 54% and 1-3 include the bottom 23%. 


intellectual level and performance on the 
Proverbs Test. The norms by intelligence 
groupings are presented in tables in the clini- 
cal manual of the Proverbs Test (10). In 
these tables the mean score and SD of nor- 
mals and schizophrenics can be compared for 
each stanine score group of the Word Knowl- 
edge Test. Some of these data are presented 
in Table 2, which shows combined groupings 
of stanines 1-3, 4-6 and 7-9 on the Word 
Knowledge Test. These divisions are roughly 
equivalent to the upper quartile, middle half, 
and lower quartile in a distribution of nor- 
mal subjects. 

The data in Table 2 were used as the basis 
for establishing experimental cutting points 
for differentiating normals from schizophren- 


resentative sample of testable chronic hospitalized veterans from 
rom norms in the clinical manual of the Proverbs Test (10) and 


The W. K. scores 7-9 include the top 


ics. Actually, a considerable amount of trial 
and error entered into the process. It was 
found that at the upper and middle intelli- 
gence levels, the abstract and concrete scores 
of the multiple-choice test could be used most 
efficiently in combination. This was not true 
at the low intelligence level, where the con- 
crete score had no usable differentiating value 
(see Table 2). The effectiveness of the cut- 
ting points when applied to a population of 
352 normals and 232 chronic schizophrenics 
is shown in Table 3. The fact that a substan- 
tial portion of these cases was included in the 
population from which the cutting points 
were derived should be taken into considera- 
tion in judging the effectiveness of this ap- 
plication. 
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Table 3 
Application and Cross Validation of Cutting Points for Multiple-Choice Test 


Original sample Cross validation sample 
Schizo- Schizo- 
Group and cutoff scores Normal phrenic Total Normal phrenic Total 
High Intelligence (W.K.: 7-9) 
Ab > 26 and Co < 2 86 22 108 48 6 54 
Ab < 26 and/or Co > 2 19 43 62 9 21 30 
105 65 170 57 27 84 
Middle Intelligence (W.K.: 4-6) 
Ab> 15 and Co <4 112 23 135 24 8 32 
Ab < 15 and/or Co > 4 29 56 85 4 22 26 
141 79 220 28 30 58 
Low Intelligence (W.K.: 1-3) 
Ab > 12 70 18 88 16 10 26 
Ab < 12 16 70 86 4 22 26 
86 174 20 32 52 
Combined Groups 
Predicted Classification 
Normal 288 63 351 88 24 112 
Schizophrenic 64 169 233 17 65 82 
352 232 584 105 89 194 


The table shows that there were 64 false 
positives in the normal population sample 
and 63 false negatives in the schizophrenic 
population sample. The predictive efficiency 
of the application of the cutting points was: 
for normals, 81%, for schizophrenics, 73%, 
and for the total population of 584 cases, 
77.5%. 

In attempting to establish cutting points 
for the clinica! test, it was found that the con- 
crete scores could not be used successfully in 
combination with the abstract scores. The 
latter scores used alone, however, separated 
normals from schizophrenics with about the 
same degree of efficiency as the multiple- 
choice test scores. The application of the cut- 
ting points to the 100 matched pairs de- 
scribed earlier in this paper is shown in 
Table 4. The table shows that there were 17 
false positives in the normal group, 28 false 
negatives in the schizophrenic group or a 
total of 45 misclassifications in the total 
sample of 200 cases. The efficiency of the 
clinical test for this sample is thus 77$%. 


Cross Validation 


The cutting points for use with the multi- 
ple-choice test were applied to a new popu- 
lation of normals and schizophrenics in or- 
der to cross validate the effectiveness of the 
measures. The new normal population con- 
sisted of 105 men and women in civilian and 
civil service employment at a variety of vo- 
cational levels. The new schizophrenic popu- 
lation consisted of 51 male veterans and 38 
male and female chronic state hospital pa- 
tients. The results of the cross-validation ap- 
plication of the cutting points for the multi- 
ple-choice test are given in Table 3. 

The predictive efficiency of the cutting 
points applied to this new sample of 194 
cases was 78%—almost identical with the 
original findings. The new sample contained 
men and women in both groups, confirming 
the normative data on the Proverbs Test 
which showed no sex differences (9). The 
inclusion of state hospital patients in the new 
sample indicates that the experimental cut- 
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Table 4 
Application of Cutting Points for Individual Test * 


Group and cutoff scores Normal Schizophrenic Total 
High Intelligence (W.K.: 7-9) 
Abstract score > 14 24 8 32 
Abstract score < 14 5 21 26 
29 29 58 
Middle Intelligence (W.K.: 4-6) 
Abstract score > 8 38 14 52 
Abstract score < 8 10 34 45 
48 48 96 
Low Intelligence (W.K.: 1-3) 
Abstract score > 2 21 6 27 
Abstract score < 2 2 17 19 
23 23 46 
Combined Groups 
Predicted Classification : 
Normal 83 28 111 
Schizophrenic 17 72 89 
100 100 200 


* Population ; 100 normals (Air Force basic airmen) ; 100 schizophrenics (chronic hospitalized veterans). 


ting points have validity when applied to 
other than the male veteran schizophrenic 
patients on the basis of whose scores they 
were evolved. 

Discussion 

Several questions are raised in connection 
with the utilization of the Proverbs Test for 
differentiating schizophrenics from normals. 
The comparatively poor results of the multi- 
ple-choice test at the high intelligence level 
stand out. An examination of the actual cases 
in the schizophrenic group showed that 56% 
of the false negatives were paranoid schizo- 
phrenics, whereas only 34% of the true nega- 
tives fell in this diagnostic group. This find- 
ing points to the relative ineffectiveness of 
the multiple-choice test for screening out 
paranoid schizophrenics. 

It is recognized, of course, that the effi- 
ciency of prediction is affected by the type 
of population to which the method is ap- 
plied. A population containing roughly 50% 
normals and 50% schizophrenics does not 
actually exist. If applied to an unselected 
population containing only 1% schizophren- 


ics, the predictive efficiency would be greatly 
reduced. On the other hand, if applied to a 
clinic population consisting entirely of peo- 
ple seeking assistance for mental and emo- 
tional problems, the predictive efficiency 
might well be enhanced. 

When used with schizophrenics, the free- 
answer Clinical test differs from the multiple- 
choice test in that it elicits psychotic mate- 
rial. In addition to concreteness in thinking 
which is associated with a breakdown of the 
abstracting function, the schizophrenic fre- 
quently “escapes” by a further regression 
characterized by “originality of performance” 
(1, p. 317). This is evidenced on the clinical 
test by responses characterized by autism, 
bizarreness, irrelevance, perseveration, and 
delusional preoccupation. Because of this free 
flow of schizophrenic material which is stimu- 
lated by this form of the Proverbs Test, diag- 
nosis of schizophrenia is sometimes obvious 
upon clinical evaluation even though the ob- 
jective scores fail to properly classify the 
subject. 

It should be re-emphasized that the Word 
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Knowledge Test, which was used as the 
measure of intelligence in this investigation, 
is highly correlated with the Proverbs Test 
and gives a relatively pure measure of “ver- 
bal comprehension” (9, Table 3). Other test- 
ers will find the basic data for establishing 
intelligence divisions and cutting points suit- 
able for their own purposes in the clinical 
manual of the Proverbs Test (10). 


Summary and Conclusions 


1. In a rigid statistical test to determine 
its effectiveness for discriminating between 
normal and schizophrenic subjects, both 
multiple-choice and individual clinical forms 
of the Proverbs Test yield mean scores whose 
differences are highly significant. 

2. Cutting points for differentiating nor- 
mals from schizophrenics are presented for 
three intelligence groupings which approxi- 
mate the top quartile, middle half, and low- 
est quartile. The application of these cutting 
points correctly screens out approximately 
80% of normal subjects and 75% of schizo- 
phrenic subjects. This level of efficiency was 
maintained on a cross-validation study based 
on 194 cases. 

3. The effectiveness of the screening is con- 
sistent in the three intelligence levels for nor- 
mal subjects. For schizophrenics the screen- 
ing is least effective for the high intelligence 
grouping, probably due to the ineffectiveness 
of the technique when applied to chronic 
paranoid schizophrenics. 


Received April 3, 1956. 
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The Ability of Schizophrenics to Modify Responses 
in an Interpersonal Situation’ 


M. David Diamond 
Riverside Hospital, The Bronx, New York 


The present paper discusses the results of 
an experiment designed to test the hypothe- 
sis that schizophrenics, as compared to non- 
schizophrenic emotionally disturbed individu- 
als, are deficient in the ability to modify their 
responses in the direction of the responses of 
another person. 

Mead defined rationality as the process 
whereby the individual “can take the atti- 
tudes of the others and control his action 
by these attitudes, and control their action 
through his own” (2, p. 339). Thus the “ra- 
tional” person can modify his responses in 
the light of the responses of others. 

Along similar lines of thought, Cameron 
states that, for the normal person: 


. . . the more effectively he is able to allow the atti- 
tudes and responses of others, which he predicts in 
symbolic role-taking, to influence his own reactions, 
the more competent he ought to be in social situa- 
tions (1, p. 93). 


Specifically the hypothesis tested was that 
schizophrenic individuals modify their re- 
sponses less in accordance with the responses 
of another person when these responses are 
communicated in an interpersonal situation 
than do nonschizophrenic individuals. This 
hypothesis was tested at two levels of remote- 
ness from immediate stimulation by having 
the subjects respond, first in the presence of 
another individual, and subsequently alone. 

If an individual in an ambiguous situation 
modifies his responses in accordance with the 
responses of another individual, when he is 


1 This paper is a condensation of a part cf a doc- 
toral dissertation completed at New York Univer- 
sity. The author wishes to acknowledge the help of 
Drs. L. Crafts, D. Wechsler, M. Deutsch, I. Katz, 
and P. Stout. 
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in the presence of the other person, this 
would indicate that he can at least momen- 
tarily share the points of view of another per- 
son. If the individual modifies his responses 
in accordance with the responses of another 
individual, when the other individual is no 
longer physically present, this would indi- 
cate that his ability to accept the other per- 
son’s points of view is not merely a transi- 
tory one, and that he is able to internalize 
the points of view of another. As a rough 
analogy we might, from a psychoanalytic 
point of view, consider the first task to be a 
measure of ego controls and functioning and 
the second one of superego controls and 
functioning. 

Sherif’s work with the autokinetic phe- 
nomenon suggested a technique for demon- 
strating the degree to which one’s responses 
are modified by the responses of another, and 
the degree to which the responses of the other 
person are internalized. The autokinetic phe- 
nomenon consists of the perception of appar- 
ent movement of a stationary point of light, 
and is measured by having the subjects esti- 
mate the distance through which the point of 
light seems to move. Sherif found that when 
normal individuals are faced with the am- 
biguous autokinetic phenomenon, they form 
their own frame of reference and observe the 
point of light to move within a certain range. 
When individuals first face the autokinetic 
situation alone and then face it again in a 
group situation, they tend to change their 
estimates toward a norm and range which is 
established in the group. When subjects first 
face the autokinetic situation as members of 
a group, a norm and range are established 
which are peculiar to the group, and after- 
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wards when the subjects face the same situa- 
tion alone, they tend to adhere to the norm 
and range of the group. Sherif views the en- 
tire process as a prototype of the formation 
of a social norm, which then acts as a frame 
of reference for determining later reactions 
(3). 

The autokinetic phenomenon seemed to be 
particularly appropriate for studying the ef- 
fect of social interaction on behavior because 
it enables the investigator to isolate the vari- 
able of social influence from complex field in- 
fluences such as the effects of learning and 
experience. Thus, in the present study an 
autokinetic experiment was used as a meas- 
ure of the degree to which subjects modified 
their responses in the direction of the re- 
sponses of others. 


Subjects 


The experimental subjects were thirty hos- 
pitalized male schizophrenics. Of the thirty 
schizophrenic subjects ten had been classified 
as schizophrenic reaction, paranoid type; 
nine as schizophrenic reaction, childhood 
type; seven as schizophrenic reaction, simple 
type; and four as schizophrenic reaction, 
chronic undifferentiated type. 

A control group was sought which mani- 
fested a variety of types of psychopathol- 
ogy in order to eliminate the possibility that 
the hypothesized deficiencies in interpersonal 
skills were concomitant with emotional dis- 
turbance per se. Male drug addicts in a re- 
habilitation hospital were selected as control 
subjects since, as a group, they have been 
found to present a variety of emotional dis- 
turbances. No schizophrenic drug addicts 


Table 1 


Nonparametric Median Test for Initial Differences 
Between Pair-Members’ Estimates of Light 
Movement When Judging Alone 


Below Above 
Subjects median median Total 
Schizophrenic pairs 7 8 15 
Drug addict pairs” 8 7 15 
Total 15 15 
Chi square = .13 df=1 p> .10 
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Table 2 


Nonparametric Median Test for Differences Between 
Pair-Members’ Estimates of Light Movement 
When Judging Together 


Below Above 
Subjects median median Total 
Schizophrenic pairs 4.5 10.5 15 
Drug addict pairs 10.5 4.5 15 
Total 15 15 
Chi square = 4.8 p< 02 


df=1 


were selected. Of the thirty drug addicts, 
eight were classified as inadequate person- 
ality, eight as schizoid personality; six as 
passive-aggressive personality; and one was 
classified in each of the following eight cate- 
gories: adjustment reaction of adolescence, 
adjustment reaction of childhood, mixed 
psychoneurosis, neurotic character disorder, 
personality pattern disturbance, personality 
trait disturbance, psychoneurotic reaction, 
and sociopathic personality. 

The control subjects were selected to be 
similar to the schizophrenics with respect to 
age, highest school grade completed, intelli- 
gence, race and socioeconomic background. 
The mean ages of the schizophrenic and 
drug addict groups were 19.3 and 18.5, re- 
spectively; the mean highest school grade 
completed, 9.6 and 9.9, respectively; the 
mean IQs 94.1 and 98.8, respectively. The 
two groups did not differ significantly with 
respect to any of these three characteristics. 
Both groups consisted of about 50 per cent 
Caucasian, and 25 per cent each of Puerto 
Rican and Negro individuals, primarily of 
lower socioeconomic background. 


Procedure 


The modification of responses in accord- 
ance with the responses of others was meas- 
ured by means of an autokinetic experiment, 
in which the subject judged the apparent 
amount of movement of a stationary point of 
light in a darkened room, first when alone, 
then with another individual, and then finally 
alone again. Each subject made twenty esti- 
mates when alone. On the basis of these ini- 
tial estimates when judging alone, subjects in 
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each group were then assigned to pairs so 
that the initial differences between pair-mem- 
bers’ estimates were approximately the same 
for subjects in both groups. The subject then 
made twenty judgments when with his pair- 
member and finally twenty more judgments 
when alone again. 

The relative amount of convergence of 
pair-members’ estimates when judging to- 
gether provides a measure of the degree of 
modification of responses when in an inter- 
personal situation. The relative amount of 
convergence of estimates of pair-members 
when later judging alone provides a measure 
of the degree to which the “group norm” was 
introjected or internalized. 

For each of the three parts of the experi- 
ment—individual judgments, judgments in 
pairs and subsequent individual judgments— 
the median of the twenty estimates was de- 
termined for each subject. The difference be- 
tween the medians of pair-members’ esti- 
mates for subjects in the two groups was 
then compared by means of the nonpara- 
metric median test for each part of the ex- 
periment. The nonparametric median test is 
essentially a chi-square test. 


Results 


Table 1 presents the results of the non- 
parametric median test which tests the hy- 
pothesis that the initial difference between 
pair-members is the same for the two groups. 
The resulting chi-square value of 0.13, which 
is not statistically significant, supports the 
null hypothesis that the two groups come 
from populations having the same median 
difference between pair members with respect 
to their assignment to pairs. Thus the pairs 
in the two groups can be considered similar 
with respect to initial differences between the 
medians of their estimates, before the experi- 
mental variable was introduced. 

When judging with the pair members, sub- 
jects in both groups modified their responses 
in accordance with those of the pair members. 
However, the modifications made by the drug 
addicts were greater than those made by the 
schizophrenics. 

Table 2 presents the results of the median 
test for the differences between median esti- 


mates of pair members when judging together. 
The obtained chi-square value of 4.8 is sig- 
nificant at the .02 level. Thus, the data sup- 
port the hypothesis that schizophrenic indi- 
viduals modify their responses in accordance 
with the responses of another person less than 
do nonschizophrenic drug addicts, when re- 
sponding in the presence of the other person. 

The results obtained with the drug addicts 
are similar to those obtained by Sherif when 
he performed an autokinetic experiment with 
pairs of normal subjects. When responding in 
a group after having first responded alone, 
the median difference between Sherif’s pair- 
members’ estimates was about 0.50 of an 
inch, which is the value obtained for the drug 
addict pairs (3). 

When responding alone after having judged 
with the pair member, the drug addicts con- 
tinued to modify their responses in accord- 
ance with those of the pair member as much 
as when they were in the presence of the pair 
member. The schizophrenics, on the other 
hand, tended to deviate more from their pair- 
members’ estimates when no longer in the im- 
mediate presence of the pair member. 

Table 3 presents the results of the non- 
parametric median test for the median differ- 
ence between pair-members’ estimates when 
judging alone after judging in pairs. The ob- 
tained chi-square value of 6.5, which is sig- 
nificant at the .01 level, supports the hy- 
pothesis that, after having participated in an 
actual interaction with another person, schizo- 
phrenic individuals modify their responses 
less in accordance with the responses of the 
other person than do nonschizophrenic drug 
addicts. 


Table 3 


Nonparametric Median Test for Differences Between 
Pair-Members’ Estimates of Light Move- 
ment When Judging Alone After 
Judging Together 


Below Above 


Subjects median median Total 
Schizophrenic pairs a 11 15 
Drug addict pairs 11 4 15 
Total 15 


Chi square = 6.5 df =1 
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In this part of the experiment, also, the 
drug addict pairs resemble Sherif’s normal 
pairs (3), in that the median difference be- 
tween pair-members’ estimates when judging 
alone after judging together was about 0.50 
of an inch for both Sherif’s group and the 
drug-addict group. 


Discussion of Results 


The results support the hypothesis that 
schizophrenic individuals modify their re- 
sponses less in accordance with the responses 
of another person, when these responses are 
communicated in an interpersonal situation, 
than do nonschizophrenic individuals. 

The subject’s estimates of light movement 
when judging with another person, might be 
considered, from a psychoanalytic point of 
view, to reflect the adequacy of the subject’s 
ego functioning. The person whose ego func- 
tions adequately would be expected, in the 
ambiguous autokinetic situation, to utilize the 
only frame of reference available besides his 
own—the responses of the other person—as a 
guide in making his own subsequent esti- 
mates. The results indicate that the schizo- 
phrenic has difficulty in utilizing such an ex- 
ternal frame of reference. 

The experimental evidence would suggest, 
further, that the schizophrenic does not ade- 
quately internalize the points of view of 
others. Since the superego is that aspect of 
the personality that is largely developed 
through a continuous process of introjection, 
these results suggest that, in the schizo- 
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phrenic, not only ego functioning, but also 
superego functioning is affected. 


Summary 


A variation of the autokinetic experiment 
was employed to test the hypothesis that, 
as compared to nonschizophrenic emotionally 
disturbed individuals, schizophrenics are de- 
ficient in the ability to modify their responses 
in the direction of the responses of another 
person. Thirty hospitalized male schizophren- 
ics and thirty male drug addicts judged the 
apparent amount of movement of a station- 
ary point of light in a darkened room, first 
when alone, then with another individual 
from his group, and then when alone again. 

The results supported the hypothesis. When 
estimating light movement in the presence of 
another person, the convergence of the schizo- 
phrenic pairs was significantly less than that 
of the nonschizophrenic drug-addict pairs. 
When responding alone, after having re- 
sponded in the presence of the other person, 
the convergence of the schizophrenic pairs 
again was significantly less than that of the 
nonschizophrenic drug addict pairs. 
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Concept Learning in the Educationally Retarded 
Child of Normal Intelligence 


Nissim M. Levy and Joseph M. Cuddy 


Brown University 


The report presented here is part of a 
larger project concerned with the develop- 
ment of a technique for predicting which chil- 
dren of normal intelligence, as measured by 
the usual methods, are likely to develop learn- 
ing difficulties. The commonly-used measures 
in this area are estimates of intelligence, read- 
ing readiness, and social and physical devel- 
opment. Unfortunately these measures, except 
when they are fairly low, are relatively poor 
predictors. 

Since most of the important tasks that chil- 
dren must master in school involve learning, 
a good instrument for estimating rate of 
learning may be of considerable value for 
evaluation purposes. This approach to the 
problem may be especially fruitful since rate 
of learning is very likely a complex function 
of not only intelligence, but also strength of 
motivation to learn, and reactions to failures 
during the acquisition process. 

A careful examination of typical intelli- 
gence tests leads one to the fact that most 
of the material depends heavily upon the 
achievements of the individual. Of the WISC 
subtests, for example, none of them with the 
possible exception of the coding allow the sub- 
ject an opportunity to improve performance 
during the assessment situation. This behav- 
ior sample is probably too small to be ade- 
quately used as a learning measure. However, 
it is interesting to note that Altus (1) re- 
ported consistently low scores on this subtest 
in records of 24 severe cases of educational 
retardation. 

The present study was designed to answer 
the following question: Given a group of chil- 
dren of normal intelligence some of whom are 
underachievers in school, do underachievers 
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differ significantly from normal achievers in 
ability to solve a concept-formation problem? 
A concept-formation problem was chosen for 
the first investigation on the assumption that 
it might provide the most sensitive measure 
of impairment in learning attributable to mo- 
tivational and emotional difficulties. 


Method and Procedure 


The Ss were 23 pairs of fourth-grade ele- 
mentary school children matched for age, sex, 
and, roughly, for socioeconomic status of par- 
ents.’ The Ss’ IQs ranged from 98 to 103 as 
measured by the California Test of Mental 
Maturity. One student of each pair was work- 
ing up to grade placement while the other 
was behind from .5 to 2.5 years in achieve- 
ment as measured by the Stanford Achieve- 
ment Battery. 

The learning task used for the study was a 
variation of the Oddity problem. Ten sets of 
Vigotsky Blocks were used. Each set con- 
sisted of three wooden blocks. In each set, 
the correct block was the odd one in terms of 
one of three possible dimensions. The three 
dimensions used were size, color, and shape. 

Each S$ was told that his problem was to 
find out how to tell which was the correct 
block each time he was presented with a dif- 
ferent set. If S chose the correct block, he re- 
ceived a chip. If he chose an incorrect block, 
E said, “That’s not right,” and the set of 
blocks was removed and another set pre- 
sented. At the end of the game he received 
a penny for each five chips he had won. 


1The authors wish to thank Dr. James Hanley, 
Superintendent of the Providence City Schools, and 
Miss Mary Lauth for their cooperation and assist- 
ance in making this study possible. 
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Five blocks of ten trials were given to each 
S, each block consisting of a random order of 
the ten different sets. If S chose correctly on 
11 consecutive trials, testing was discontinued 
and S was credited with correct choices for 
the balance of the 50 trials. Testing was con- 
tinued even when S spontaneously verbalized 
the correct principle before the end of the 50 
trials. At the end, all Ss were asked to ver- 
balize how they solved the problem. 

The measures of learning used were: (a) 
number of errors; (2) number oi trials to cri- 
terion; and (c) verbalization of the Oddity 
principle. 


Results 


Errors. In 18 out of the 23 pairs of Ss in 
the study, the normal achiever made less 
errors than his or her matched underachiever. 
Unless otherwise indicated all results reported 
were tested for significance by a two-tailed 
application of the binomial expansion. The 
probability of getting 18 out of 23 differences 
in one direction is less than the .01 level. 

Trials to criterion. Eleven Ss reached the 
criterion of 11 consecutively correct choices 
within 50 trials. Of these 11 Ss, 10 were nor- 
mal achievers while only one was an under- 
achiever. This result yields a p value of less 
than .01. 

Correct verbalization of the “Oddity” prin- 
ciple. Sixteen Ss correctly verbalized the prin- 
ciple. Of these 16 Ss, 14 were normal achievers 
while two were underachievers. This result 
yields a p value of less than .01. 

Ten Ss met both criteria. All 10 of these 
Ss were normal achievers. Again this result is 


Table 1 
Median Test for Discrepancy Scores of 
Matched Subjects 
Median 
Above or 
Condition median below 

Normal achiever reached criterion; 
underachiever did not. 10 3 
Neither S reached criterion. 2 8 


Exact p value, .012 
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significant at well below the .01 level of con- 
fidence. 

Using either criterion, 13 of the 23 pairs of 
Ss had a difference in favor of the normal 
achiever, while no difference occurred in 10 
pairs. Inspection of the data suggested the 
presence of a relationship between degree of 
discrepancy in achievement between the nor- 
mal achiever and the underachiever of a given 
pair, and the likelihood of a difference occur- 
ring in either criterion in favor of the nor- 
mal achiever. Therefore the difference score 
in achievement level for all of the pairs of Ss 
was calculated. The over-all median of the 23 
discrepancy scores was calculated and the 
data analyzed by means of the nonparametric 
median test. The contingency table arising 
from this analysis is presented in Table 1. 
Fisher’s exact method for testing the signifi- 
cance of contingency tables with small fre- 
quencies was used and resulted in a p value 
of .01, confirming the suggestion of a rela- 
tionship between the degree of underachieve- 
ment and the likelihood of a difference in 
favor of the normal achiever. 

It should be pointed out that two-tailed 
tests are conservative estimates of significance 
in this study since the implicit hypothesis 
being tested calls for a prediction of a di- 
rectional difference. We would predict from 
both an empirical and theoretical basis that 
the underachievers would do less well. 


Discussion 

The results of the study are quite clear. 
Every measure of learning used demonstrated 
a significant difference in performance be- 
tween our two groups. The finding that pairs 
of Ss with higher discrepancy scores more 
often yielded a difference in criterion meas- 
ures is further support for the hypothesis that 
underachievers learn more slowly. 

Since 29 of the 46 Ss failed to meet either 
criterion it is quite likely that the limit of 50 
trials was too low. An examination of the ver- 
balization of the Ss who did not solve the 
problem indicated that many of them were 
at least on the way toward solution. When 
asked how they tried to solve the problem, 
some of them indicated partial solution by 
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suggesting that it was the smaller one or 
larger one or the one that was a different 
shape. The Ss were credited with the correct 
solution only if they indicated all three di- 
mensions or simply said that the correct block 
was always different from the other two. 

A difference in the rate of learning of a 
concept has been demonstrated in this study. 
The question arises as to whether this differ- 
ence is restricted to the conceptual area or 
whether it may be a more generalized deficit. 
In an attempt to answer this question the 
same children are being used as Ss in a num- 
ber of other types of learning problems such 
as pursuit-rotor, stylus maze, paired associ- 
ates, mirror drawing, discrimination, and 
problem boxes. 

Whether we find underachievers demon- 
strating a general deficit in learning rate or 
whether it turns out to be primarily related 
to the verbal or conceptual areas the question 
will still remain whether or not we can de- 
tect these potential underachievers in pre- 
school or the first grade. 

Much of the thinking of clinicians con- 
cerned with the problem of the underachiever 
suggests that in many cases the familial in- 
terpersonal constellation and early history of 
the child play a considerable role in develop- 
ment of this condition. It would be reason- 
able to assume then that these children have, 
by the time they reach school age, already 
developed those patterns of behavior which 
will result in underachievement. This sug- 
gests that we can hope to find a learning-rate 
difference in children of normal intelligence 
by the time they reach school age which may 
be predictive of later underachievement in 
school. 

It must be emphasized that what we are 
suggesting is not the use of learning tasks as 
a substitute for the standard intelligence 
scales. Hodges (4) following a lead of Hunt’s 
(5) reported a study in which he estimated 
mental age from a variety of measures of 
performance in a double-alternation problem. 
The correlations which Hodges obtained 
varied between .22 and .74. It should be 
clear from his results that these two tech- 
niques are different. 


Binet, it will be recalled, defined intelli- 
gence as the ability to profit from experience. 
Most intelligence tests use the same assump- 
tion that Binet used in order to justify them- 
selves. This assumption is that all subjects 
measured by the particular instrument had 
the same opportunity for the experiences 
which are relevant to the material being used 
as the instrument. That this assumption is a 
weak one has adequately been pointed out 
before (2, 3). In part it is the weakness of 
this assumption which makes the use of such 
tests relatively inadequate for the prediction 
of academic success. Perhaps if Binet had 
worked on his problem after Thorndike, the 
tests which he would have developed might 
have been more along the lines of the work 
we are suggesting. 


Summary 


Twenty-three pairs of fourth-grade children 
matched for age, sex, and very roughly for 
socioeconomic status served as Ss in a con- 
cept-learning task. The Ss’ IQs ranged from 
98 to 103 as measured by the California Test 
of Mental Maturity. One S of each pair was 
working up to grade placement while the other 
was behind from .5 to 2.5 years in achieve- 
ment as measured by the Stanford Achieve- 
ment battery. All Ss were given a maximum 
of 50 trials within which to reach the criterion 
of 11 consecutively-correct choices. A varia- 
tion of the “Oddity” problem was used. At 
the end of testing, all Ss were asked to ver- 
balize how they attempted to solve the prob- 
lem. The measures of learning were: (a) num- 
ber of errors: (5) number of trials to cri- 
terion; and (c) correct verbalization of the 
oddity principle. 

In 18 out of 23 pairs of Ss, the normal 
achiever made fewer errors than his matched 
underachiever. Eleven Ss reached the cri- 
terion of 11 consecutively correct choices. Six- 
teen Ss correctly verbalized the principle. 
Fourteen were normal achievers while only 
two were underachievers. Ten Ss met both 
criteria and all 10 of these Ss were normal 
achievers. All of these results were statis- 
tically significant below the .01 level of con- 
fidence. 
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Implications of these results were discussed the Chicago studies of cultural bias in intelli- 

and the direction of further needed research gence tests. Harvard educ. Rev., 1953, 23, 384- 
397. 

was suggested. 3’. Davis, A. Social-class influence upon learning. 
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Antisocial behavior may be viewed in terms 
of disequilibrium between an impulse system 
and a control system, the latter involving as- 
sessments of reality consequences as well as 
internalized standards serving as restrictions 
upon behavior. The present study is designed 
to examine certain disturbances of the im- 
pulse-control balance as they are manifested 
in TAT productions. Lindzey has referred to 
the “determination of the conditions under 
which inferences based on the projective ma- 
terial directly relate to overt behavior and 
the conditions for the reverse” (2, p. 18) as 
one of the major interpretive problems for 
the TAT. Studies examining the relationship 
between fantasy and overt behavior have 
yielded varied results (5, 9). Neither Murray 
(5) nor Sanford et al. (9) found any sub- 
stantial direct relationship between the in- 
tensity of fantasy aggression and its overt 
expression. However, in explaining their re- 
spective results, each hypothesizes a factor of 
cultural prohibition serving to prevent the 
overt gratification of antisocial needs. In or- 
der to circumvent the complicating factor of 
middle-class inhibitions regarding aggressive 
behavior, Mussen and Naylor (6) studied a 
group of lower-class boys whose culture pre- 
sumably encourages rather than inhibits overt 
aggression. They found strong support for a 
positive relationship between fantasy aggres- 
sive needs and overt aggressive behavior. 
Moreover, they found suggestive evidence that 
the amount of punishment anticipated relative 
to aggressive needs was an important deter- 


1 The author wishes to express his appreciation to 
Richard Besnia, Carl Ejisdorfer, Marvin Greenstein, 
Maurice Kouguell, and Dr. Philip Kubzansky who 
collected the data for this study and assisted in its 
statistical evaluation. 

2 Now at the University of Kentucky. 


449 


minant of overt aggression. With the excep- 
tion of Mussen and Naylor’s work, previous 
investigations attempting to correlate covert 
needs and overt behavior have tended to neg- 
lect the role of inhibiting or defensive forces 
in modifying the behavioral resultant of covert 
needs. Mussen and Naylor, however, made no 
effort to distinguish between external and in- 
ternal controlling forces. 


Hypotheses 


The major hypotheses of the present study 
may be stated with reference to different 
facets of the impulse-control balance: 


A. Impulse System 


1. There is a direct relationship between 
degree of aggressive, antisocial behavior, and 
the number of aggressive thoughts manifested 
in TAT stories. 

2. The form as well as the number of TAT 
aggressions is related to the extent of overt 
aggressive behavior. More aggressive indi- 
viduals are likely to express TAT aggressions 
crudely and directly, relatively unhampered 
by repressive processes. 


B. External Control System 


3. Overt antisocial behavior varies inversely 
with the anticipated amount of fantasy pun- 
ishment from an external source. 

4. Among antisocial individuals themes of 
external punishment are more likely to serve 
as justifications for aggression than among 
nonantisocial individuals. This hypothesis is 
derived from Redl and Wineman’s (8) de- 
tailed account of the “delinquent ego.” By 
this is meant the ego whose every effort is 
bent toward obtaining guilt-free and anxiety- 
free enjoyment of delinquent impulsivity. 
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C. Internal Control System 


5. Overt antisocial behavior varies inversely 
with the anticipated amount of fantasy pun- 
ishment from an internal source. 

6. Of the two types of punishment, internal 
and external, anticipated internal punishment 
is the more significant inhibitor of antisocial 
behavior. 


Method 


Fifty-seven male Army trainees between the 
ages of 17 and 26 with a mean age of 20 were 
used as subjects (Ss) for this experiment. All 
were psychiatric referrals to the Mental Hy- 
giene Clinic for whom psychological testing 
was requested. The only patients excluded 
were those suffering from organic brain dam- 
age or those diagnosed as psychotic. 

Each S was seen individually by one of 
three clinical psychology specialists who ad- 
ministered cards 1, 3BM, 4, 6BM, 7BM, 
8BM, 12M, 13MF, 14, and 18BM of the 
TAT. A fourth clinical psychology specialist 
and the author, neither of whom had adminis- 
tered any of the TATs, independently scored 
the TAT protocols for measures of aggressive 
impulses with respect to both quantity and 
quality, anticipated punishment from ex- 
ternal agencies, and anticipated punishment 
from within. 


The criteria used for identifying fantasy aggres- 
sions and punishments were based on those described 
by Mussen and Naylor (6) with certain modifica- 
tions. A fantasy aggression score was derived for 
each S by counting the frequency of instances of all 
aggression such as the following: fighting, criminal 
assault, getting angry, criticizing; running away, re- 
sisting coercion, being negativistic, lying, cheating, 
stealing, dominating or restraining someone, rejecting 
someone. The occurrence of death, illness, or acci- 
dent to parents or other loved objects in a story was 
also regarded as fantasy aggression. Since the present 
study is concerned only with extrapunitive aggres- 
sion, such items as suicide, self-injury, and self-de- 
preciation were not scored as fantasy aggression. The 
Pearson r, used as the reliability measure in every 
case, showed the correlation between independent 
scorings to be .91. 

An external punishment score was obtained by 
summing the frequency of such themes as the fol- 
lowing when they were directed toward the hero: 
assault, injury, threat, quarreling, deprivation of some 
privilege, object or comfort, domination, physical 
handicap such as blindness, etc., rejection. Scoring 
reliability was .81. 


Internally based punishment included suicide, self- 
depreciation, and feelings of guilt, shame, or remorse. 
Since a broad definition of punishment encompasses 
“injury to a loved object,” instances of death, ill- 
hess, or accident to parents or other loved objects 
were recorded in this category of internal punish- 
ment. In instances in which the hero of the story was 
the direct agent of aggression, e.g., husband striking 
his wife, internal punishment was not scored except 
where there was a clear expression of remorse or 
shame. By contrast, accidental injury, death, or ill- 
ness of a loved one always received a scoring of in- 
ternal punishment unless the hero specifically denied 
anything resembling guilt feelings. Scoring reliability 
was .89. 

Remoteness of fantasy aggression from overt be- 
havior was rated on a seven-point scale with one 
representing the most direct form of fantasy expres- 
sion. For each instance of fantasy aggression counted 
in the TAT story, a remoteness tating was made. 
The following considerations were kept in mind when 
evaluating remoteness: (a) object of aggression (per- 
son, society, animal, inanimate object), (6) time 
(present, past, future), (c) place (customary habitat, 
other countries, other planets), (d) level (behavior, 
wish, memory, daydream, nightdream, special states 
of consciousness such as intoxication, dissociation, 
drug addiction, insanity), (e) social context and in- 
stigator (hero instigates aggression, merely tags 
along with crowd, or even has nothing directly to 
do with aggression, e.g. death of someone due to dis- 
ease or accident). The reliability for the number of 
one and two remoteness ratings scored per S was 
.96. 


Measurement of aggressive antisocial be- 
havior was based on the social history data 
routinely obtained by the psychiatric social 
work specialists. Such information included 
a record of S’s military delinquencies, e.g. 
AWOL, refusal to obey orders, assault, etc., 
as well as S’s report of any delinquent civilian 
behavior, e.g., truancies, gang fighting, steal- 
ing, peddling of drugs, etc. The military in- 
formation was, of course, easily verifiable but 
the data pertaining to premilitary activities 
were not, except in certain instances in which 
records were requested from reformatories or 
other institutions mentioned by S. Several 
factors, however, indirectly testified to the 
substantial accuracy of Ss’ accounts of their 
civilian behavior. First, the civilian and mili- 
tary behavior data were almost invariably 
consistent with respect to delinquent charac- 
teristics. Nevertheless, it was important to 
use the premilitary history to get a long-term 
picture of S’s characteristic aggressive pat- 
tern, particularly for those individuals whose 
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time in the military was relatively brief. Sec- 
ond, a subsequent check of psychiatric diag- 
nosis revealed no case in which there was 
gross discrepancy between the clinical impres- 
sion of the psychiatrist and S’s report of his 
aggressive behavior pattern. Third, on every 
occasion in which independent confirmation 
of civilian behavior was available (15 cases), 
S’s own account was corroborated. 

Data pertinent to the variable of aggressive 
behavior were extracted from each S’s social 
history by the three clinical psychology spe- 
cialists who had been responsible for testing. 
The author and one of these specialists rated 
the behavior extracts for aggressivity on a 
seven-point scale with one representing the 
least degree of aggressive behavior. The pos- 
sibility of knowledge of test scores influenc- 
ing behavior ratings or vice versa was negli- 
gible for two reasons. First, the specialist who 
served as a judge to check the reliability of 
the behavior ratings had administered only 
25 per cent of the total number of TATs, and 
second, the behavior ratings were accom- 
plished two months prior to the TAT scor- 
ing. Sample extracts of the extremes of the 
scale are presented below. 


Rating of 1: Steady school attendance record 
through fifth term high school. After leaving school 
worked his way up to level of assistant pressman in 
print shop at $110 per week. He was drafted from 
a firm which had employed him since 1948. Married 
and living with wife and child for three years. No 
military or civilian delinquencies reported. 

Rating of 7: Long history of truancies, delin- 
quency and classroom misbehavior. In reform school 
four years. Prior to induction was confined two 
years eight months for armed robbery. While AWOL 
came in possession of drugs and jailed one month. 
Assaulted several people. Enlisted in Army on advice 
of parole officer to get reduction of sentence. History 
of eight AWOLs and four courts martial. Now fac- 
ing a general court martial, five years in prison, and 
a dishonorable discharge because he wrecked stock- 
ade dispensary. 


The reliability of these behavior ratings 
was .91. In order to assure large enough 
groups for comparison the seven-point scale 
was then consolidated into a three-point scale, 
combining ratings of one and two; three, four 
and five; and six and seven. Hereafter, Group 
I of 24 Ss refers to the least antisocial group, 
Group II of 19 Ss to the intermediate, and 


Table 1 
Product-Moment Intercorrelation of TAT Variables 


Remoteness Int. Pun. Ext. Pun. 


FA FA FA FA 
3° 
-45 38 
-65 -—.13 89 01 


* An r of .36 is required for significance at the 1% level. 


Group III of 14 Ss to the most antisocial 
group. 
Results 


Since the interpretation of results depends 
in some degree upon the relative independence 
of the five TAT variables their intercorrela- 
tions are presented in Table 1. Most of the 
relationships are statistically significant as 
seems logically required by the fact that com- 
plementary aspects of the impulse-control fac- 
tor of personality are represented. However, 
in nearly all instances, the amount of vari- 
ance in one variable that can be accounted 
for by the variance in another (as measured 
by r*) is very low to moderate. Therefore, 
each of the variables may be said to con- 
tribute some unique information beyond their 
common meaning. 

As shown in Table 2 the analysis of vari- 
ance indicates over-all significant differences 
between the means of the three groups. This 
does not tell us, however, how well the three 
groups are separated. Therefore, before de- 
scribing these results in detail, more precise 
information on the differences between the 
groups is presented in Table 3. Two-tailed 
tests of significance were used. It is apparent 
that in nearly all instances each group is sig- 
nificantly different from the other two. 

The difference between the means of the 
three groups in number of fantasy aggression 
themes, as shown in Block A of Table 2, is 
highly significant beyond the value of p= 
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Table 2 
Analysis of Variance for Three Groups Differing in Degree of Antisocial Behavior 


Mean 


TAT Variables 


F 


Variance* 


. Fantasy Aggression 
Group I 6.08 
Group II 8.05 
Group III 


. Number of 1 and 2 
Remoteness Ratings 


Fantasy Aggression 


Between Groups 136.69 23.98 .001 
Within Groups 5.70 


Group I .10 Between Groups 71 35.50 001 
Group IT 17 Within Groups 02 
Group IIT 


. External Punishment 
Fantasy Aggression 

Group I 87 

Group II .70 

Group ITI 


. Internal Punishment 
Fantasy Aggression 
Group I 79 


Group IT 32 
Group III 


. Interna] Punishment 
Externai Punishment 


Between Groups 33 3.66 05 
Within Groups 09 


Between Groups 2.88 20.57 001 
Within Groups 14 


Group I 1.03 Between Groups 3.63 12.96 001 
Group II 52 Within Groups .28 
Group ITI 


Within Groups df = 54. 


* Between Groups df = 2. 


.001. Since several authors have suggested 
the importance of social-class sanction as a 
factor governing the overt expression of covert 
aggressive needs, this variable was explored 
within the limits of the data. The bulk of 
the antisocial group was of lower-class back- 
ground whereas the nonantisocial group was 
predominantly middle class (,? significant at 
5% level). 

To press the examination further, the Ss in 
Groups I and III (least and most antisocial) 
were then separated according to social class. 
Using the wu statistic described by Lord (3), 
the significance of the difference in number of 
aggressions for middle-class Ss only was cal- 
culated. The same was then done for lower- 
class members of Groups I and III. Group 
III Ss produced significantly more aggressive 
themes than Group I Ss regardless of social 


class. Although the number of Ss on whom 
these comparisons are based are relatively 
few (15 middle-class Ss in Group I and 5 in 
Group III, 6 lower-class Ss in Group I and 9 
in Group III), the p values were .02 and .01. 
These several findings confirm the validity of 
Hypothesis 1 which concerns the relationship 
between aggression in fantasy and behavior. 

To evaluate Hypothesis 2 the ratio of one 
and two remoteness ratings to total fantasy 
aggression score was compared for the three 
groups. The results, shown in Block B of 
Table 1, reveal the difference between the 
means to be significant well beyond » = .001. 
Not only does the antisocial individual re- 
spond with more aggressive themes, but the 
quality of his fantasy aggression is far more 
direct and undisguised than is true for the 
nonantisocial individual. 
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Block C of Table 1 compares the ratio of 
external punishment to fantasy aggression 
for the three groups. The significance of the 
difference between the means attains a value 
of p= .05. Thus an important force inhibit- 
ing the nonantisocial group appears to involve 
some sort of anticipated punitive action from 
without, giving support to Hypothesis 3. 

Hypothesis 4 calls for a closer look at the 
fantasied external punishments. These were 
separated into three categories according to 
the following principles: (a) external punish- 
ment occurring subsequent to fantasy aggres- 
sion and as a direct consequence of such 
aggression, e.g., “he was sent to jail for steal- 
ing”; (6) external punishment occurring prior 
to fantasy aggression and used as a justifica- 
tion for such aggression, e.g., “she was mean 
to him so he hit her”; (c) external punish- 
ment having neither of the above relationships 
to fantasy aggression, e.g., “she held him 
back because she was afraid he might get 
hurt.” The total number of external punish- 
ments is larger for Group III than Group I 
with = .05. It is clear, however, that TAT 
themes of external punishment are far more 
frequently utilized as excuses for aggression 
by antisocial members of Group III (p= 
.001), thus confirming Hypothesis 4. The 
mean number of themes classified in this 
manner was 3.64 for Group III, 1.25 for 
Group I. External punishment themes as a 
direct consequence of aggression are also more 
common among Group III Ss although the ¢ 
value of 1.89 falls just short of significance. 
Since the number of postaggression punish- 
ment themes is obviously dependent upon the 
number of fantasy aggressions, an attempt 
was made to compare retaliatory punishment 
themes on a proportional basis. The data 
showed that the mean ratio of retaliatory 
punishment themes to fantasy aggression is 
identical for Groups I and III so that it was 
not even necessary to calculate a significance 
statistic. Interestingly enough, the number of 
external punishment themes unrelated to fan- 
tasy aggression is larger among the nonanti- 
social members of Group I (p = .01). 

Block D of Table 1 indicates that the dif- 
ferences between the group means, with re- 
spect to the quantity of guilt and shame 
themes proportional to aggression, is signifi- 
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Table 3 


t Ratios of Group Differences * 


wic Sle Sle 
Groups “ole § Sif sis 
compared Sola Ale ale 
wie Sie Sia 
Group I 
vs. 3.18 1.75 177 3.36 2.83 
Group IT 
Group II 
vs. 417 5.33 1.09 600 4.07 
Group IIT 
Group I 
vs. 6.04 7.84 2.27 444 4.40 
Group III 
* Minimum f¢ ratios required for significance: 1.69 = <.10 
2.04 = <.05, 2.75 = <.01, 3.64 = <.001. 


cant at the value » = .001. Since the anti- 
social Ss produced significantly more aggres- 
sive themes than the nonantisocial Ss, one 
may wonder whether this alone is not the 
crucial determinant in creating differential 
values of the ratio of internal punishment to 
fantasy aggression. If one were able to find 
individuals in Groups I and III with aggres- 
sive impulses of similar strength, then the 
effect of internal punishment expectations 
could be independently evaluated. Overlap- 
ping in the number of fantasy aggressions 
between Ss of Groups I and III permitted 
comparison of six members from each group. 
They were the six highest and lowest in 
Groups I and III respectively in number of 
fantasy aggression themes. The w statistic 
shows no significant difference between the 
two groups with respect to fantasy aggres- 
sion. However, the proportion of internal pun- 
ishment to fantasy aggression is much larger 
for the nonantisocial group (p = .001). The 
mean ratio was .11 for the six Group III 
members and .54 for the six Group I mem- 
bers. 

The relationship between socioeconomic class 
and antisocial rating raises a similar question 
regarding internal inhibition as was posed for 
fantasy aggression. Does the difference in 
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number of internal punishment themes for 
antisocial and nonantisocial groups hold true 
irrespective of social class membership? The 
u statistic shows clearly that Group I Ss pro- 
duce significantly more internal punishment 
themes than Group III Ss regardless of social 
class. The difference between the middle-class 
members of Groups I and III is significant 
beyond the .001 level and among lower-class 
members of the two groups the difference 
achieves significance at the .01 level. 

As shown in Block E of Table 1, the ratio 
of internal to external punishment themes 
differs significantly for the three groups (p 
= .001). The much higher ratio for Group I 
Ss suggests that internal punishment antici- 
pations are a more potent deterrent to anti- 
social behavior than are external punishment 
anticipations, supporting Hypothesis 6. 


Discriminative Power 

Since a widely used diagnostic instrument 
was employed in this study it was deemed 
desirable to obtain some measure of diagnos- 
tic power or forecasting efficiency as distin- 
guished from statistical significance. In order 
to evaluate discriminative power accurately 
it is necessary to make three comparisons of 
overlapping scores: Group I with Group II, 


Table 4 


Discriminative Power of Indicators Obtained 
from the TAT * 


GroupsI GroupsII Groups I 


Variable & Il & It & Ill 
Fantasy Aggression 68 67 76 
Number of 1 & 2 
Remoteness Scores 58 82 90 
Fantasy Aggression 
External Punishment 58 66 
Fantasy Aggression 
Internal Punishment 
Fantasy A To 77 88 97 
Internal Punishment 


* Numbers in table refer to percentage of cases correctly 
classified. 


Group II with Group III, and Group I with 
Group III. In each instance the median of the 
overlapping scores of the two populations be- 
ing compared was selected as the cutoff point. 
Thus there were three different cutoff points 
for three separate comparisons. The percent- 
age of correct classifications is calculated 
from the number of cases in each two-group 
comparison falling on the predicted side of 
the overlap median score. The data in Table 
4 suggest that the most discriminating vari- 
ables are the two internal punishment ratios. 
In clinical practice, however, it would seem 
appropriate to approach this classification 
problem by judging the interrelationship 
among the factors. For example, one of the 
members of Group I had a high fantasy ag- 
gression score but fell well within the non- 
antisocial group in all other scoring cate- 
gories. 
Discussion 

Since the finding of a positive relationship 
between fantasy and aggressive behavior held 
true for Ss of the middle as well as the lower 
class some attempt at explanation seems war- 
ranted in view of the contrary results of San- 
ford et al. (9) and Murray (5). The range in 
severity of aggressive behavior among Ss in 
the present study seems far greater than was 
true for either of the previous studies and 
this, of course, made for an easier discrimi- 
nation. Furthermore, it is the author’s im- 
pression, also suggested by Redl (8) and 
Feshbach (1), that certain individuals who 
characteristically behave in impulsive, anti- 
social fashion not only find fantasy an un- 
satisfactory substitute for direct action but 
may actually be stimulated and provoked to 
act out by a succession of onrushing hostile 
images. 

The association between lower-class mem- 
bership and antisocial behavior is perhaps 
partially reflective of the freer rein given ag- 
gressive impulses in this group as suggested 
by previous authors. It is important to note, 
however, that when antisocial behavior does 
occur, whether it be among middle- or lower- 
class Ss, these individuals manifest stronger 
aggressive impulses and/or a weaker internal 
control system than nonantisocial individuals. 
Thus the distinguishing feature is the indi- 
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vidual impulse-control balance, although spe- 
cific cultural groups may tend to foster vary- 
ing types of impulse-control balance with 
respect to aggression. 

The analysis of remoteness of aggression is 
related to the problem of symbolic repre- 
sentation of impulses and conflicts in projec- 
tive data. This investigation demonstrated 
that antisocial Ss were more direct in their 
expression of aggression than nonantisocial 
Ss. It appears, then, that defensive inhibitions 
alter not only the sheer quantity of aggres- 
sive themes but also the way in which they 
are formulated. This is a common enough 
clinical observation but one which does not 
seem to have been given sufficient weight in 
experiments concerned with the problem of 
inferring behavioral tendencies from projec- 
tive material. 

Anticipation of punishment as a conse- 
quence of aggressive behavior was no more 
prominent among nonantisocial Ss than anti- 
social Ss. This is not’ to say that such ex- 
pectation may not serve to deter antisocial 
action in many instances. However, the re- 
sults do suggest that it is not one of the criti- 
cal distinctions between the two groups. On 
the other hand, a significantly larger number 
of punishment themes unrelated to aggression 
either as retaliation or justification was char- 
acteristic of Group I Ss. The content of 
themes classified in this manner suggests a 
greater awareness and acceptance of external 
direction and restraint. This finding may re- 
late to the demonstration by Pastore (7) of 
the importance of perceived arbitrariness of 
the frustrating situation as a determinant of 
aggressive reaction. 

As noted earlier in this paper there is a 
marked tendency for antisocial Ss to justify 
their aggressive reactions by pointing to the 
harsh treatment imposed on them by others. 
If antisocial individuals can prove that the 
world is against them, then there is no need 
to recognize any obligations to others. The 
very fact that they so frequently find it nec- 
essary to resort to this projection technique 
argues that the delinquent ego is busily en- 
gaged in disposing of still existing remnants 
of guilt. One of the implications of this state 
of affairs is that any general policy of retalia- 
tion only reinforces this defensive maneuver 
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and makes it easier for the delinquent to pur- 
sue his antisocial course on a guilt-free basis. 
The basic importance of internal controls in 
the inhibition of antisocial behavior is amply 
demonstrated by the data. Certainly the eva- 
sion of guilt, shame, or remorse is a far more 
vital mark of the delinquent than any lack of 
awareness concerning potential external pun- 
ishment consequent to antisocial behavior. 

Another pertinent topic involves the hy- 
pothesis that fantasy behavior has a com- 
pensatory function. Tomkins (10) assumes 
this hypothesis in stating that TAT aggres- 
sive fantasies are found more commonly 
among neurotics and normals than antisocial 
characters. The results of the present study 
suggest that the opposite is the empirical fact, 
given Ss differing widely in antisocial be- 
havior. However, this finding is not neces- 
sarily contradictory to the fantasy substitu- 
tion hypothesis. One possibility is that the 
more potent and numerous aggressive im- 
pulses of the antisocial Ss obscure any rela- 
tive increment in fantasy aggression in the 
nonantisocial Ss. An alternative suggestion 
mentioned earlier is that fantasy is not uni- 
versally employed as a defensive or adaptive 
measure. Clinical experience tells us that in- 
dividuals differ widely in the use of fantasy 
and it may simply be that many directly 
hostile individuals have not learned to resort 
to fantasy for substitute gratification. 


Summary 


This study has relevance to two problems. 
Its first interest was to examine certain per- 
sonality variables, particularly in terms of 
the impulse-control balance, in order to es- 
tablish their relationship to overt aggressive 
tendencies. The TAT was the instrument 
through which information was gathered 
about the nature of and interrelationships 
between aggressive impulses, anticipated in- 
ternal punishment, and anticipated external 
punishment. The second major issue con- 
cerns the relationship between fantasy and 
reality behavior and the task of formulating 
from projective data alone the various cues 
which might permit differential predictions 
about behavior tendencies insofar as overt 
expression is concerned. 


t 
1 
S 
- 
d 
0 
1- 
it 
to 
le | 
n- \ 
ps 
ed 
te, 
eS 
er- 
yer 
nal i 
ils. 
di- 


456 Kenneth Purcell 


Fifty-seven young adult soldiers, the large 
majority of them basic trainees and all of 
them psychiatric referrals to the Mental Hy- 
giene Clinic, served as Ss. On the basis of 
social history data covering aggressive and 
antisocial behavioral manifestations in both 
civilian and military life, each S was rated as 
to overt antisocial tendencies. It was found 
that antisocial Ss differ from nonantisocial Ss 
in quantity and quality of fantasy aggressions, 
the former producing not only a larger num- 
ber of fantasy aggressions but also finding it 
less necessary to obscure or minimize the 
hostile impulse. The superego variable as 
measured by anticipated internal punish- 
ments was found to be of far greater sig- 
nificance in inhibiting antisocial behavior than 
fear of retaliatory punishment which did not 
even differentiate between groups. An inter- 
esting finding dealt with the tendency of 
antisocial Ss to utilize themes of external 
punishment as a justification for their fan- 
tasy aggressions. Some of the implications of 
these results were discussed. An effort was 
made to deal with the problem of socioeco- 
nomic class factors as they might relate to 
the data. Finally, emphasis was placed on the 
fact that, for behavioral prediction from pro- 
jective material, the same attention must be 
given to repressive, inhibitory forces mani- 


fested in the data as is given to the impulse 
or drive system. 
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Situational Influences on Rorschach Responses 


Edith M. Henry and Julian B. Rotter 
The Ohio State University 


The widespread use of “projective” tests for 
purposes of diagnosis and prediction of be- 
havior has stimulated extensive research in- 
vestigating the validity of the underlying as- 
sumptions of these tests. One of the most 
significant areas in which research has raised 
questions is that of the effect of immediate 
influences on projective test productions. In 
social learning theory (14, 15) such influ- 
ences are conceptualized as the effect of the 
psychological situation on behavior. 

A growing body of literature demonstrates 
that many factors other than an individual’s 
basic personality structure are operating in 
any test situation, and that these factors are 
important determiners of the types of re- 
sponses to be given. That is, individuals adapt 
their behavior to the total environment in 
which they find themselves, and external 
stimuli as well as inner motivations and dy- 
namics shape behavior at any given time. 
The present study is an attempt to show 
further the importance of situational factors 
and to predict the types of changes in re- 
sponse which will occur as a result of a par- 
ticular type of change in the testing situation 
- utilizing the Rorschach Test. 

More specifically this study is concerned 
with the subject’s (S’s) belief regarding the 
purpose of the test as it affects his test re- 
sponses. A number of previous investigations 
(3, 4, 7, 9, 10, 11, 12, 16) have dealt with 
other kinds of situational influences on the 
Rorschach test, including such variables as 
the effect of immediate stress, the sex and 
personality of the examiner, suggestion, and 
the physical place of testing. There have also 
been a number of studies more closely re- 
lated to the present one. 


Fosberg (5, 6) found little consistent effect of 
asking Ss to make the best possible and the worst 


impression without, however, clarifying how they 
should do this. Carp and Shavsin (2) however in a 
similar study found intra-individual differences al- 
though there were no statistical group differences. 
That is, Ss changed their responses considerably but 
they did not do so in a consistent manner. Luchins 
(13) in investigating why Ss did not give certain 
kinds of responses in the early part of the Rorschach 
test found that one of the more frequent reasons 
was the S’s fear of being labeled “psycho.” Other 
reasons also implied that the S was responding to 
his interpretation of the purpose of the test. Simi- 
larly, Hutt, Gibby, Milton, and Pottharst (8) have 
shown that variations in instructions will produce 
significant variations in responses. Schachtel (17) 
has discussed in detail how the S’s implicit definition 
of the Rorschach test will affect responses but has 
not investigated these experimentally. 


In the present study, the situational vari- 
able of the S’s definition of the test was ma- 
nipulated and then investigated. It was felt 
that one of the most widespread views of the 
Rorschach, resulting from publicity of mass 
media, is that the Rorschach test is a meas- 
ure of “sanity,” that this test will uncover 
imbalances in a person’s emotional life. In ac- 
cord with the previous discussion, it was hy- 
pothesized that such a view of the test would 
alter an individual’s responses in the direc- 
tion of caution and the giving of only what 
most Ss would consider to be “safe” re- 
sponses. In order to test these assumptions, 
the experimental Ss were given instructions 
about the test which included the suggestion 
that this test is designed to discover emo- 
tional disturbances of a serious nature in 
mental patients. An underlying assumption 
of the study was that, when Ss respond in a 
particular way after being given a hypothesis 
about the test, these responses are compa- 
rable to those of another individual who 
holds the same hypothesis implicitly. The 
following hypotheses were formulated: 
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The experimental group will give signifi- 
cantly; (a) fewer responses in total (R) 
than will the control group; (4) more re- 
sponses scored as form (F) than will the con- 
trol group; (c) more responses of good form- 
level (F +) than will the control groups; (d) 
more large detail (D) responses than will the 
control group; (e) fewer whole (W) re- 
sponses than the control group; (f) more 
popular (P) responses than will the control 
group; (g) more animal (A) responses than 
will the control group; (/#) fewer responses 
of an aggressive and violent nature than will 
the control group. 

In all these hypotheses the assumption is 
made that Ss in the experimental situation 
will give, in the main, those responses which 
they consider to be “safe,” inhibiting those 
which they consider suspect, those which 
would tend to place them in an unfavorable 
light. That is, they will give more responses 
which are conforming, which they expect to 
be similar to what most other people would 
see on the same blot or area. The present 
study differs from other Rorschach studies 
in that a situational variable is manipulated, 
and hypotheses are formulated about the 
changes in the determinants on a logical, 
“common-sense” basis rather than on the ba- 
sis of the usual theoretical meaning of the 
various determinants although the two ap- 
proaches coincided in some instances. The 
hypothesis about decrease in total number of 
responses for the experimental Ss, for in- 
stance, was formulated because of the gen- 
eral tendency toward caution and inhibition. 
Similarly, an increase in pure form responses 
was hypothesized not because of its theoreti- 
cal meaning as a measure of intellectual con- 
trol but rather because form responses are 
often easier to see and those based on color, 
texture, or movement are frequently less clear 
and would be considered suspect. Large de- 
tail responses, again, were believed to corre- 
spond to the more obvious percepts; the pre- 
ponderance of popular responses is in line 
with a “common-sense” attempt at caution 
and safety, at seeing things as most other 
people are likely to see them. The hypothe- 
sized increase in animal responses concurs 
with an increase in popular responses. The 
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assumption that a large number of animal re- 
sponses suggests intellectual immaturity was 
not employed. It was felt that a reduction in 
aggressive and violent types of responses 
would result from attempts by Ss to make 
their responses as innocuous as possible, to 
inhibit any signs of lack of control over 
feelings. 


Procedure 


Subjects. Sixty female college students in 
elementary psychology were randomly al- 
lotted to two groups of thirty. 

Procedure. When S entered the experimen- 
tal room, she was asked the following initial 
questions: 


“Have you ever heard of the Rorschach test .. . 
the inkblot test?” If the response was affirmative, 
the following question was asked: 

“Tell me what you remember about it? . . . What 
do you think it does? What is it for?” 

Responses to these questions were recorded by E 
who then gave the main instructions. For the experi- 
mental group these were: 

“This is a series of cards with inkblots on them. 
People look at the cards and tell what the blots re- 
mind them of. For many years the test has been 
used in mental hospitals to study emotional disturb- 
ances and to discover how seriously disturbed pa- 
tients were. Now we are making a college survey 
with it. 

“Now I want you to look at the blots and tell me 
what they could be, what they look like to you, or 
what they remind you of. People see all different 
things in the blots—there are no right or wrong 
answers.” 


The individuals in the control group, every 
second S, were also asked the questions about 
their previous acquaintance with the test, and 
then were given the standard Klopfer instruc- 
tions: 


“This is a series of cards with inkblots on them. 
I want you to look at them and tell me what they 
could be, what they look like to you, or what they 
remind you of. People see all different things in the 
blots—there are no right or wrong answers.” 


Following this, the Rorschach was adminis- 
tered in the standard manner to all Ss, and a 
fairly thorough inquiry followed. 

After the test, the experimental Ss were 
asked a number of general questions, designed 
to elicit information about the effectiveness 
of the instructions. 


 . 


ae 


Scoring. All identification was removed from 
the records, and the protocols were scored ac- 
cording to Beck’s (1) manual. An independ- 
ent rater then scored twelve records, also 
using Beck’s criteria. A short scoring manual 
was constructed for the presence of content 
involving aggression or violence. The cate- 
gories used were as follows: explosions; fire; 
blood; people or animals battered, beaten, or 
demolished; fighting or killing; monsters in- 
volved in some kind of threatening action. A 
frequency count of these themes was then 
made by the experimenter and by the same 
independent rater.* 

Statistical methods. To test reliability, a 
percentage of agreement was computed be- 
tween the two scorers on twelve records. A 
Pearson product-moment correlation was used 
to test reliability of scoring for aggressive 
content on all sixty protocols. 

To test the hypothesis dealing with total 
number of responses, a ¢ test for differences 
between means was used. For all other hy- 
potheses, two different techniques were used. 
The first technique involved converting all 
scores to percentages and then computing ¢ 
tests for differences between the means of the 
percentages. The second technique was essen- 
tially a method of equalizing the scores to 
account for any differences in productivity 
(Total R) between the two groups.’ 


1The authors would like gratefully to acknowl- 
edge the assistance of Mr. Walter Mischel in this 
respect. 

2 We are grateful to Dr. Robert Wherry who sug- 
gested a method for equalizing the effects of an un- 
equal number of responses in the two groups. To 
equalize scores the following steps were taken for 
each variable except R: (a) The total number of re- 
sponses for both experimental and control groups 
was computed; (b) The total number of responses 
on any particular variable, such as F, were com- 
puted, and this number was divided by the total R 
to give a predicted score on the variable under con- 
sideration for the group as a whole; (c) To arrive 
at an individual predicted score, this total predicted 
score was multiplied by each individual’s total R; 
(d) This predicted score was then subtracted from 
each individual’s actual score on the variable being 
considered. This score served as an indication of 
where each individual’s score fell in relation to the 
score that would be predicted from total group per- 
formance, These scores were then used in the com- 
putation of ¢ tests of significance between the means 
of the two groups. 


Situational Influences on Rorschach Responses 


Results 


Reliability of measures. Percentages of 
agreement were calculated between the scor- 
ing of E and an independent rate on twelve 
protocols. (On the remaining protocols, the 
E’s scoring alone was used with all identifica- 
tion first removed from the records.) Separate 
computations were performed for location, de- 
terminants, and content. The percentage of 
agreement for scoring of location was .95, for 
determinants, .82, and for content, .92. A 
second reliability check concerned the scoring 
for aggressive and violent content, scoring 
having been based on a brief manual con- 
structed by E. Here, all sixty protocols were 
used. Pearson product-moment correlations 
were used here, with separate computations 
for the experimental and control groups. The 
correlation coefficient for the experimental 
group was .83, and for the control group, .66. 

The results of the tests of the experimen- 
tal hypotheses are given in Table 1. 

a. R (total responses) lower in experimen- 
tal than control group: The experimental 
group mean was 16.07, and the control group 
mean was 23.47. A ¢ of 4.11 supports the hy- 
pothesis beyond the .001 level of confidence. 


Table 1 


Sigmas, Means, és and Significance of Differences 
Between Means for the 30 Experimental 
and 30 Control Subjects 


Variable oe Mean, o- Mean, p 
Responses 6.20 16.07 7.52 23.47 4.11 <.001 


From Equalized Scores 
Popular 185 .56 1.97 —.56 2.23 <.05 
Form 243 .76 3.56 —.75 188 <.10 
Form-Level 1.17 .66 249 —.67 2.62 <.02 
Aggressive 1.94 —.48 180 —.12 1.28 <.20 


Animal 2.14 73° 2.86 —.74 2.25 <.05 

Whole 3.48 15 5.04 87 63 

Detail 3.91 23 3.97 22 02 <.90 
From Percentages 

Popular 12 32 04 26 2.29 <.02 

Form 19 61 2.50 <.02 


Form-Level 08 87 80 2.70 <.01 
Aggressive Al 07 10 108 <.20 
Animal 4 53 12 43 293 <01 
Whole 23 45 46 <A44 92 <.40 
Detail a DH <P 
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b. More F (form) responses in experimen- 
tal than control group: Using equalized R 
scores, means of .76 and — .75 were obtained 
for the experimental and control groups, re- 
spectively. A ¢ of 1.88 was obtained, which 
yields a p value between the .05 and .10 lev- 
els and tends to substantiate the hypothesis. 
Using percentage scores, the mean value for 
the experimental group was .72 and .61 for 
the control group. A ¢ of 2.5 supports the 
hypothesis between the .02 and .01 levels of 
confidence. 

c. More F + (good form level) responses 
in experimental than control group: With 
equalized scores, means were .66 and — .67 
for experimental and control groups, respec- 
tively. A ¢ of 2.62 supports the hypothesis 
between the .02 and .01 levels. A ¢ calculated 
from percentages yielded a value of 2.70, sig- 
nificant beyond the .01 level in the predicted 
direction. Mean values were .86 and .80 for 
the experimental group and the control group, 
respectively. 

d. More D (large detail) responses in ex- 
perimental than control group: With equal- 
ized scores, means of .23 and .22 were ob- 
tained for the experimental and control 
groups. A ¢ of .02 and, with percentages, a 
t of .34 failed to give any substantiation to 
this hypothesis. Means of percentage scores 
were, respectively, .50 and .49. 

e. Fewer W (whole) responses in experi- 
mental than control group: Using equalized 
scores, a mean of .15 was obtained for the 
experimental group and of .87 for the con- 
trol group. A ¢ of .63 with equalized scores 
and .92 with percentage scores did not sub- 
stantiate this hypothesis. Means with per- 
centage scores were .45 and .44 respectively. 

f. More P (popular) responses in experi- 
mental than control group: Experimental 
group mean, with equalized scores, was .56; 
control group mean was — .56. A ¢ of 2.23 
with equalized scores lent support to this hy- 
pothesis between the .02 and .05 level. With 
percentages, means were .32 and .26, respec- 
tively. A ¢ of 2.29 supported the hypothesis 
between the .025 and .01 levels. 

g. More A (animal) responses in experi- 
mental than control group: Using equalized 
R scores, means were .72 and — .74, respec- 
tively. A ¢ of 2.25 supports this hypothesis 
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between the .02 and .01 levels. With percent- 
ages, means were .53 and .43 for experimen- 
tal and control groups, respectively. With ¢ 
calculated from percentage scores, signifi- 
cance was between the .01 and .001 levels. 

h. Fewer aggressive responses in experi- 
mental than control group: With equalized 
scores, means were — .48 and — .12, respec- 
tively. A ¢ of 1.28 fell between the .10 and 
.20 levels of confidence. Using percentage 
scores, means were .07 and .10, respectively. 
A ¢ of 1.08 also fell between the .10 and .20 
levels, and, therefore, did not significantly 
substantiate this hypothesis but was in the 
right direction. 

Discussion 

The results of this study, in general, seem 
to substantiate the hypotheses advanced. In 
regard to aggressive responses, the total num- 
ber produced in the group was too few to ob- 
tain significance in the results and the as- 
sumption that the more conforming responses 
were frequently more D than W was not 
borne out. The other hypotheses regarding 
the number of responses, the number of form 
responses, form-level, popular responses, and 
animal responses, were all supported to a 
highly significant degree. 

The real significance of this study does not 
lie in its demonstration of the effect of a dif- 
ferent type of instructions but rather in the 
implications that subjective hypotheses about 
what the test is testing will have a significant 
effect on the S’s responses. As applied to the 
Rorschach test some specific information is 
presented in this study as to what that effect 
may be when the S has the hypothesis or be- 
lief that the test is essentially a test of in- 
sanity, mental disturbance, psychopathology, 
etc. Such a belief is probably fairly commonly 
held as a result of the widespread populariza- 
tion of the Rorschach through magazine ar- 
ticles, Sunday supplements, movies, radio, 
television, and also through direct com- 
munication. The results support strongly 
Schachtel’s assertion that it is extremely im- 
portant for an examiner to find out what 
these implicit beliefs regarding the test are, 
and to evaluate test results in the light of 
them. 
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It is of some interest that the general ques- 
tioning of the Ss following the test adminis- 
tration did not reveal too much in the way of 
awareness that the Ss were trying to appear 
in a good light, not mentally disturbed, etc. 
Some Ss after considerable pushing would 
finally agree that they tried to avoid men- 
tioning anything that might sound peculiar, 
but in general, more prediction could prob- 
ably be obtained by assuming certain effects 
likely to occur as a result of the S’s hypothe- 
ses about the test purposes than could be ob- 
tained by questioning the S regarding his 
motives in taking the test. 

The study also tends to support, at least 
by implication, the general notion that con- 
ceptualizing the S’s behavior (including test 
responses) in a test situation as an attempt 
to reach some goal to give a particular kind 
of impression or to defend himself against 
some threat can be a useful predictive method 
from a common-sense point of view. 

The results also tend to support the now 
growing body of literature which emphasizes 
the need to interpret psychological test re- 
sults in light of a host of situational variables, 
no matter how indirect the test, rather than 
limiting analysis to the test protocols or pro- 
files obtained from several tests or subtests. 


Summary 


This study measured the influence of one 
aspect of the psychological situation, knowl- 
edge of test purpose, on responses to the Ror- 
schach test. The Ss were sixty female stu- 
dents in elementary psychology. Experimental 
Ss were told that the Rorschach is used to 
discover serious emotional disturbances, then 
given standard Klopfer instructions; control 
Ss received only Klopfer instructions. Hy- 
potheses were that experimental Ss would 
give fewer total, aggressive, and whole re- 
sponses; more pure form, good form-level, 
popular, animal, and large detail responses. 
The specific hypotheses all involved the gen- 
eral hypothesis that experimental Ss would 
be cautious and conforming. Results were 
analyzed by two methods for every variable 
except total responses. By one, raw scores 
were converted into percentages, and ¢ tests 


computed. The second involved a technique 
of equalizing scores to eliminate any influence 
of reduced number of total responses in one 
group. With total responses, ¢t’s were com- 
puted on differences between means. Results 
showed fewer responses in experimental group 
(beyond the 1% level), more good form- 
level, popular, and animal responses (be- 
tween 5% and 1% levels), more form re- 
sponses (between 10% and 5% levels), fewer 
aggressive responses (between 20% and 10% 
levels). Location did not vary significantly 
between groups. 


The results tend to support the general hy- 
potheses that: (a) a knowledge of the S’s be- 
liefs regarding the purposes of the test should 
be an important aspect of the interpretation 
of the test, (5) a “common-sense” analysis 
of the S’s approach to the whole testing 
situation would be a fruitful part of the test 
interpretation, and (c) situational determi- 
nants significantly influence test responses. 
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Affective Control and Rorschach Human 
Movement Responses’ 


Julian Meltzoff and Dorothy Litwin’ 
Veterans Administration Regional Office, Brooklyn, New York 


Previous studies have demonstrated that 
one of the important behavioral correlates of 
Rorschach Human Movement Responses (M) 
is the ability to inhibit or delay motor acts 
voluntarily. In support of hypotheses drawn 
from both Werner’s Sensory-Tonic theory 
and Rorschach’s empirical observations, a 
series of experiments (3, 4, 5, 6) has shown 
that Ss who are responsive to the kinesthetic 
aspects of the inkblots are better able to in- 
hibit motor acts than those who are not. Fur- 
ther investigations (1, 2) have extended this 
relationship to the cognitive realm. Experi- 
ments by Meltzoff and Levine have shown 
that good motor inhibition goes hand in hand 
with the ability to inhibit learned associations 
voluntarily and substitute new ones. Further- 
more, effective cognitive inhibition was shown 
to be positively related to the perception of 
movement in the same way as was motor in- 
hibition. Thus M appears to be an indicator 
of the ability to inhibit or control both mo- 
tor acts and thoughts. 

The aim of the present study is to extend 
the relationship of M and inhibitory ability 
one step further to include the control of 
feelings as well. 


The Experiment 


Hypothesis. A direct relationship between 
the production of Rorschach Human Move- 
ment Responses and affective control is hy- 
pothesized. It is predicted that more Ss with 
high M than with low M will be able to in- 


1 Paper read at East. Psychol. Ass., Atlantic City, 
March, 1956. 

2From the Veterans Administration Regional Of- 
fice, Brooklyn, New York. 


hibit the overt expression of an experimen- 
tally induced affect. 

Procedure. The general method consists of 
inducing an affect in the Ss, instructing them 
not to express their feelings, and assessing the 
success or failure of their control. Considera- 
tion of the various methodological and ethi- 
cal problems involved in experimenting with 
strong, unpleasant affects, led to the selection 
of one of the pleasant emotions instead. An 
effort was made to induce an affective state 
by presenting a humorous stimulus. The 
stimulus chosen was “The Jones Laughing 
Record,” a recording by Spike Jones and his 
orchestra for RCA-Victor. This is a recording 
of contagious laughter with an accompani- 
ment of humorous background music and 
assorted noises. 

The judgment as to what constituted an 
affective response was made on the basis of 
fairly gross pre-established criteria. Any re- 
action ranging from the first sign of a smile 
to an outburst of laughter was considered as 
an affective response. This included any re- 
action from a break in the set of the mouth 
with a widening and upturning at the cor- 
ners, to the opening of the mouth accom- 
panied by the characteristic spasmodic in- 
spiration and expiration of laughter. 

In a preliminary trial with two observers 
there was a complete agreement on judgments 
of the mere presence or absence of an affec- 
tive response at some point during the 2}- 
minute recording. A single observer was used 
in the experiment proper. 

The recording was pretested on a sample of 
20 mixed, haphazardly selected college stu- 
dents, 19 of whom responded by smiling or 
laughing. Following the record, Ss were asked 
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to respond in the usual manner to Cards III 
and VII of the Rorschach. The sample was 
then divided at the median into High (2 or 
more) and Low (less than 2) M groups. It 
was determined that there was no significant 
difference between the two groups in the 
number of Ss who responded with an overt 
expression of affect (y? = .12, p = .70). Thus 
the record was successful in inducing an im- 
pulse to laugh, and the expression of that im- 
pulse, regardless of the amount of M pro- 
duced by the Ss. 

Sixty-eight additional Ss from the same 
general population were introduced to the 
same experimental procedure with one cru- 
cial modification. Prior to hearing the record, 
Ss were instructed not to smile or laugh. This 
sample was similarly divided at the median 
into high M and low M groups. The two 
groups were compared on their success or 
failure in controlling the expression of the 
experimentally induced affect. 

It is to be noted that the Rorschach was 
administered following the laugh record in 
order to rule out any bias in the judgment of 
affective response. Thus the experimenter had 
no way of knowing whether a subject was 
going to fall in the high M or low M group 
at the time of making the judgment about 
the presence or absence of an affective re- 
sponse. 

Rorschach protocols were scored independ- 
ently by two judges without knowledge of 
the Ss performance on the inhibition task. In 
the few cases of scoring disagreement the 
judgment of a third psychologist served as 
the deciding factor. 


Results 


Results were in the predicted direction. Of 
the 34 Ss in the high M group, 20 were able 
to listen to the entire record without any ob- 


Table 1 
Affective Control of High and Low M Groups 


Successful Failure to 
Group Control Control 
High M (2 or more) 20 14 
Low M (less than 2) 11 23 


Julian Meltzoff and Dorothy Litwin 


servable display of feeling. Only 11 of the 
34 Ss in the low M group were similarly suc- 
cessful (Table 1). 

Since the direction was hypothesized, a one- 
tailed test of significance was used. The ob- 
tained chi square of 3.79 is significant at the 
.03 level. Analysis of the data in terms of the 
total number oi responses (R) reveals that 
the obtained relationship is not dependent 
upon R (xy*=1.27, p> .20). Tetrachoric 
correlation between M and affective control 
was .42 (p < .01). 


Discussion 

These results support the hypothesis of a 
significant positive relationship between M 
and the voluntary inhibition of expression of 
an experimentally induced impulse to laugh. 
Although these data cannot be generalized to 
the control of other affects, one could hy- 
pothesize that further experimentation with 
other affects would reveal similar relation- 
ships. 

It is of interest to speculate about the proc- 
esses involved in controlling affective expres- 
sion. Is there any fundamental difference in 
process between the inhibition of the expres- 
sion of an affect and the control of the affect 
itself? The two may be equivalent only from 
the point of view of the outside observer. 
Controlling affective expression appears to 
have both motor and cognitive components. 
A person can keep from expressing an im- 
pulse to laugh, for example, by controlling 
certain facial muscles. The affect can remain 
even though its overt expression is controlled 
by motor means. Some of the subjects re- 
ported that they felt very much like laugh- 
ing, but restrained themselves by clenching 
their teeth and compressing their lips. 

The other way of controlling the expression 
of affect is through the modification of the 
affective tone itself by cognitive processes 
which alter the autonomic reaction pattern. 
By way of illustration, one of the subjects 
reported that she kept from laughing by 
thinking of her mother dying. She was thereby 
able to change the affective tone through cog- 
nitive means. The affect need not be changed 
to the extent of substituting an opposite one, 
as thoughts could be used in a variety of 
ways merely to neutralize the existing feel- 


ing. We have already cited experiments which 
have demonstrated that the abilities to inhibit 
motorically and cognitively are each corre- 
lated with M. If voluntary control of the im- 
pulse to express an affect is viewed in terms 
of its motoric and cognitive components, the 
present experiment may simply be reflecting 
these relationships in a more complex setting. 

As suggested above, there may be two basic 
styles of inhibiting affective expression—one 
primarily motoric, and the other principally 
cognitive. We might speculate further and in- 
voke a concept of motor-cognitive vicarious- 
ness. This would point up the possibility that 
motor inhibition tends to lead to the produc- 
tion of thoughts or fantasies which in turn 
serve to modify the affective tone. Observa- 
tions and reports from the Ss suggest that 
their initial adaptation to the task was 
through motor control. It appeared that this 
was later supplemented by cognitive meth- 
ods at different stages according to the indi- 
vidual. This progression from motor to cog- 
nitive control can be thought of as a re- 
capitulation of the developmental process, in 
that the ability to use cognitive processes to 
control feelings might be expected to appear 
at a later stage of personality development. 
Observations of young children suggest that 
control largely through motor means is 
gradually supplemented by the acquisition of 
a variety of cognitive techniques which can 
modify the affective state. When these mecha- 
nisms are refined in the normal adult, it is 
more difficult for the observer to detect the 
affective state than it is in the emotionally 
labile child. We would expect different be- 
havorial and symptomatic manifestations in 
adults with these two different styles of con- 
trol in accordance with the differences in 
their chronic reaction pattern. Thus we might 
anticipate different symptom complexes in in- 
dividuals who are chronically exerting motor 
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control in contrast to others whose control is 
exercised largely through thinking. It would 
therefore be important to contrast individuals 
who are found to utilizé predominantly one or 
the other of these methods from the point of 
view of relating their style of control to devel- 
opmental levels and to the complicated prob- 
lem of symptom choice in psychopathology. 


Summary 


Previous experiments have demonstrated 
motor and cognitive inhibition to be behav- 
ioral correlates of Rorschach Human Move- 
ment responses. The present study extends 
the relationship of M and inhibition ability 
further to include the control of affective im- 
pulses as well. An impulse to laugh was in- 
duced, and Ss instructed to refrain from any 
affective expression. In support of the hy- 
pothesis, more high M than low M subjects 
were successful at inhibiting any overt ex- 
pression of the experimentally induced affect. 


Received February 21, 1956. 
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The Use of the Kaleidoscope as an Adjunct 


to the Rorschach 


Emanuel Starer 
VA Hospital, Northport, N. Y. 


One of the problems encountered in the 
psychological testing of many chronic psy- 
chotic patients is that they frequently pro- 
duce only ten or fewer responses on the Ror- 
schach. As a result, the examiner is hampered 
in his attempt to describe the underlying per- 
sonality structure in detail. 

The writer has employed the kaleidoscope 
with 20 chronic patients, with the instruc- 
tions to report whatever was seen. After the 
patient had finished associating to a given 
stimulus, he was instructed to shift the ka- 
leidoscope which would automatically intro- 
duce another stimulus. In this manner an 
unusually large number of stimuli could be 
introduced in a relatively short space of time. 
Approximately 15 to 20 minutes were allowed 
for each patient. The associations to the ka- 
leidoscope could be analyzed in a manner 
similar to content analysis on the Rorschach. 

Most of the patients produced two to three 
times the number of responses obtained on 
the Rorschach. Indeed, one patient who re- 
fused the Rorschach volunteered to associate 
to the kaleidoscopic stimuli and produced 30 
responses which were helpful in evaluating 
some of his areas of conflict. 


Examples of some of the responses obtained with 
the kaleidoscope include those frequently obtained 
from the Rorschach such as: bugs, butterflies, bats, 
insects, plants, flowers, and designs, usually geometric 
in nature. 


Responses containing a color component included: 
a bloody rag, two big red blotches, face of a cat 
with red eyes and red nose, and I see a discolored 
ball, sort of pale and brown. 

Some of the responses containing human, animal, 
or inanimate movement included: man’s face, sort 
of stupid looking with sharp nose, statue of the 
Lord, He’s bowing His head, face of a cat who is 
ready to eat you up, crabs clawing an insect, a 
spider making up his mind whether to poke at me 
or not, three bolts of lightning hitting an object 
from three different directions. 

Finally, some examples of highly personalized as- 
sociations included: pair of ice tongs like the ones I 
used when a boy, type of Pennsylvania-Dutch de- 
sign, a hex sign, insignia of the devil, fundus of the 
uterus, mop of hair of a child, pancake hat for 
young lady, and motor neurone. 


The kaleidoscope technique is recommended 
only when there is a paucity of associations 
on the Rorschach, and so can be considered 
only an adjunct. Much valuable information 
is lost since only a content analysis can be 
made. Perhaps the most objectionable to 
many investigators is that the stimuli are not 
constant and so can not be reduplicated. In 
spite of these and other objections, this ex- 
aminer has found the adjunctive use of the 
kaleidoscope in personality evaluation to be 
worthy of further investigation and use in 
practical clinical testing situations, particu- 
larly with chronic psychotic patients. 


Received February 29, 1956. 
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The Use of Human Figure Drawings in the 
Diagnosis of Organic Pathology 


Marvin Reznikoff and Donald Tomblen 
The Institute of Living 


Clinical psychologists are often asked to 
aid in the differential diagnosis of organic 
pathology. One of the widely used techniques 
in psychodiagnostics is the human figure- 
drawing test. Experience has suggested that 
this relatively unstandardized method may 
prove to be useful in attempting to evaluate 
the presence or absence of organic difficulties, 
but there are relatively few published studies 
in this area. Michal-Smith (2) reported the 
results of the H-T-P technique, comparing 
boys whose EEGs were abnormal with boys 
who were found to be normal with reference 
to their EEGs. Only one sign yielded a sig- 
nificant difference, namely, “line quality.” 

Machover (1) and Vernier (3) both listed 
indicators which they mentioned as “often 
seen” in the organic case. For Machover, 
these included: disproportionately large head; 
empty, large, poorly proportioned figures; 
weakly synthesized drawings; and simple, 
heavy line quality. Vernier suggested “petal” 
or scribbled fingers; center-of-page place- 
ment; shrunken arms and legs; erasures in 
arm and hand area; and eyes drawn as slits. 
In agreement with Machover, she also listed 
largeness of head and whole figure; empti- 
ness, or lack of detail; weak synthesis; and 
poor proportions. However, she described the 
line quality as sketchy and/or broken, rather 
than heavy and simple. 

The present study was undertaken to in- 
vestigate: (a) the degree to which these vari- 
ous suggested indicators are actually more 
characteristic of the drawings of organic pa- 
tients than they are of the drawings of other 
psychiatric groups, and (6) the extent to 
which these indicators may reliably be em- 
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ployed in the differential diagnosis of or- 
ganicity. 


Procedure 


Sample. Three groups of 25 patients each 
were selected from the files on the basis of 
final diagnoses, which in this institution fol- 
low the criteria and classifications described 
in the Diagnostic and Statistical Manual of 
Mental Disorders prepared by the American 
Psychiatric Association. In one group, all pa- 
tients had been given organic diagnoses, spe- 
cifically “chronic brain syndrome.” Another 
group of patients were diagnosed as schizo- 
phrenic, and the last group as having neurotic 
disorders. Organic findings were entirely ab- 
sent in the schizophrenic and neurotic groups. 
All patients were at least average in intelli- 
gence and between 19 and 55 years of age, 
with a mean age of 35.4 years. All patients 
had drawn human figures during routine psy- 
chological testing. The patients had not been 
tested by either author, nor was any informa- 
tion about the personal history, diagnosis, or 
results of other psychological tests available 
to the authors when they scored the drawings. 

Scoring method. A list of 17 indicators was 
compiled, based upon those suggested in the 
literature plus several others derived from the 
personal experience of the senior author. 
Scoring criteria for each indicator were de- 
lineated. In the instance of No. 15, “En- 
larged figure size,” actual measurement of 
the drawing length was recorded. For all 
other indicators, a simple plus-minus system 
was used. If an indicator was clearly present, 
it was scored plus; otherwise, it received a 
minus score. The indicators were: 
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Table 1 


Analysis of Variance of Drawing Size Measurements of 
Organic, Schizophrenic, and Neurotic Patients 


Source of Mean 
variation df square F 
Between groups 2 6.24 1.15 
Within groups 72 5.44 
Total 74 


1. Emptiness of figure (lack of details) 

2. Weak synthesis (parts loosely joined) 

3. Parts misplaced (noticeably shifted) 

4. Disproportionately large head 

5. Shrunken arms and legs (thinner and shorter) 

6. Other parts distorted (inappropriate size and/ 
or shape) 

7. “Petal” or scribbled fingers 

8. Eyes drawn as slits 

9. Center-of-page placement 

0. Erasures in arm and hand area 

1. Erasures elsewhere (two or more in single 
drawing) 

12. Heavy, simple line quality 

13. Sketchy line quality 

14. Broken line quality 

15. Enlarged figure size 

16. Unlifelike pose 

17. Erasures with only minimal improvement 


The drawings of all 75 subjects were ran- 
domly arranged with all identifying data re- 
moved. Each of the authors independently 
scored every drawing for every indicator. 


Table 2 


Drawing Indicators Showing Significant Differences 
Between Groups of Organic and Neurotic 
Patients of Twenty-five Each 


Frequency of 
occurrence 
Or- Neu- Chi 
Indicator ganic otic square p 
Weak synthesis 15 1 15.53 001 
Parts misplaced 15 1 15.53 001 
Shrunken arms and 
legs 14 2 11.12 001 
Parts other than head 
and extremities dis- 
torted 15 3 10.50 O01 
Petal or scribbled 
fingers 13 4 5.70 .02 
Emptiness of figures 15 5 6.75 O01 


Marvin Reznikoff and Donald Tombien 


Two indicators proved to be unscorable in 
practice and therefore were eliminated. These 
were: No. 16, “Unlifelike pose,” and No. 17, 
“Erasures with only minimal improvement,” 
both original with the authors. It was impos- 
sible to arrive at a satisfactory, objective 
definition of unlifelike pose, and it was like- 
wise extremely difficult to determine the de- 
gree of improvement after erasures without 
actually observing the drawing process itself. 
On the 15 retained indicators scored for every 
drawing, there was 91 per cent interscorer 
agreement. Minor differences of scoring were 
readily resolved in conference, still independ- 
ent of any knowledge of the subject’s diag- 
nostic category. 

Treatment of data. Indicator No. 15, “En- 
larged figure size,” was tested separately by 
means of an analysis of variance of figure 
measurements. All other indicators (Nos. 1-— 
14) were studied together. A tabulation was 
made of the frequency of occurrence of each 
indicator among the 25 patients in each diag- 
nostic group. Maximum frequency would thus 
be 25. Each group was then compared with 
the other two groups by means of chi square. 

Analysis of variance was again utilized to 
test whether or not the patients in the or- 
ganic group made scores significantly higher 
than patients in the other groups. For this 
particular analysis, the contradictory hy- 
potheses concerning line quality were elimi- 
nated, and a score was obtained for each pa- 
tient consisting of the number of indicators 
present out of a possible total of 11 (Nos. 1- 
11). These scores were converted to percent- 
ages, and an angular transformation was done 
in order to stabilize the variance. The analy- 
sis-of-variance design was then applied to the 
transformed scores of the 75 patients dis- 
tributed into the three diagnostic groups. 


Results and Discussion 


From the results of the analysis of variance 
cited in Table 1, which yielded nonsignificant 
results, it is evident that the organic patient 
does not characteristically draw larger figures 
than other patients. 

Tables 2 and 3 show that 5 of the 14 indi- 
cators are indeed more characteristic of or- 
ganic drawings than of drawings of either 
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schizophrenics or neurotics. An additional in- 
dicator, emptiness of figures, showed a sig- 
nificant difference between organic and neu- 
rotic productions, but not between organic 
and schizophrenic drawings. The remaining 
indicators occurred with approximately equal 
frequency in all groups, with any discrep- 
ancies attributable to chance factors. When 
schizophrenics were compared with neurotics, 
none of the indicators differentiated between 
the groups to a statistically significant degree. 

It may be noted here that Vernier’s sketchy 
or broken line indicator was present more 
often in the organic group than was Mach- 
over’s simple, heavy line. However, neither of 


Table 3 


Drawing Indicators Showing Significant Differences 
Between Groups of Organic and Schizophrenic 
Patients of Twenty-five Each 


Frequency of 
occurrence 


Or-  Schizo- Chi 


Indicator ganic phrenic square p 
Weak synthesis 15 5 6.75 O1 
Parts misplaced 15 5 6.75 01 
Shrunken arms and 

legs 14 0 16.77 001 
Parts other than head 

and extremities dis- 

torted 15 6 5.26 OS 
Petal or scribbled 

fingers 13 1 12.00 001 


these indicators appeared to a greater extent 
in the organic group than they did in the 
other groups. 

Results of a final analysis are reported in 
Table 4 and indicate the highly significant 
degree to which organic patients surpass both 
schizophrenics and neurotics in the number 
of indicators present in their individual draw- 
ings. 

Since the analysis-of-variance design ap- 
plied to the transformed scores demonstrated 
a disparity among the three groups signifi- 
cant at the .001 level, the groups were next 
considered two at a time. The ¢ test was ap- 
plied using the mean square value of the 
within-group source of variation (error term) 
as the pooled variance. Results of the com- 


Table 4 


Analysis of Variance of Transformed Drawing Scores of 
Organic, Schizophrenic, and Neurotic Patients 


Source of Mean 
variation df square F 
Between groups 2 1612.19 13.65* 
Within groups 72 118.15 
Total 74 


* Significant at the .001 level of probability. 


parisons are shown in Table 5. It can be seen 
that the scores of the organic patients dif- 
fered from both those of the neurotics and 
schizophrenics at the .001 level of confidence, 
while the difference between scores of the 
schizophrenic and neurotic patients was not 
statistically significant. 

Although these indicators distinguish be- 
tween groups, it remains to be determined 
how effectively they can be utilized in indi- 
vidual differential diagnosis. Table 6 reveals 
that as many as 7 out of 11 indicators must 
be present in order to establish with certainty 
that the correct diagnosis is an organic one, 
rather than neurosis or schizophrenia. With 
such stringent requirements, one would miss 
nearly three-quarters of all organic cases! 
Reducing the number of indicators to 5 
would still result in 40 per cent false nega- 
tives and also in obtaining 4 per cent and 16 
per cent false positives from among the neu- 
rotic and schizophrenic groups, respectively. 
Since these predictions are based on the same 
cases as were used for the analysis of the 
indicators, it is likely that a greater number 
of false diagnoses would occur on cross vali- 
dation with a new sample. The evidence 
clearly points to the limitations of a naive 
reliance on a purely “sign” approach to the 


Table 5 


Separate Group Comparisons of Drawing Scores of 
Organic, Schizophrenic, and Neurotic Groups 


Comparison t p 
Organics with neurotics 4.66 001 
Organics with schizophrenics 4.16 001 
Neurotics with schizophrenics 0.49 not 

significant 
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Table 6 
Cumulative Distribution of Drawing Scores of Organic, Schizophrenic, and Neurotic Patients 


Organics Schizophrenics Neurotics 
Drawing Numberof %of Numberof % of Numberof % of 
score* patients patients patients patients patients patients 
1 or more 24 96 24 96 24 96 
2 or more 24 96 20 80 19 76 
3 or more 21 84 15 60 14 56 
4 or more 19 76 7 28 6 24 
5 or more 15 60 4 16 1 4 
6 or more 12 48 1 4 1 4 
7 or more 7 28 0 0 0 0 
8 or more 4 16 0 0 0 0 
9 or more 1 4 0 0 0 0 


* Drawing score equals number of indicators recorded as present out of maximum of eleven. 


diagnostic task. Nevertheless, the sophisti- 
cated clinician may well utilize the more dis- 
criminating indicators as a basis for further 
investigating the possibility of organic pa- 
thology in a patient. 


Summary 


This study attempted to evaluate a num- 
ber of drawing indicators cited in the litera- 
ture as characteristic of organic patients. It 
was found that five of these indicators were 
more prevalent among a group of organic pa- 
tients than among schizophrenic or neurotic 
groups. These were: weak synthesis; parts 
misplaced; shrunken arms and legs; parts, 
other than head and extremities, distorted; 
and petal or scribbled fingers. The remaining 
ten indicators did not differentiate between 
the three groups. 


When the number of indicators present was 
scored for the patient’s individual drawings it 
was found that the organics had significantly 
higher scores, but the groups overlapped to 
a considerable degree, limiting the usefulness 
of this approach alone for individual differ- 
ential diagnosis. 


Received February 24, 1956. 
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Drawings of the human figure have been 
fairly widely used and studied as a psycho- 
logical diagnostic and personality assessment 
technique. They have been used as bases for 
inferences about intellectual or maturity level, 
conflict areas, and self concept or “body im- 
age.” The present study is an investigation of 
the extent to which figure drawings reflect 
self concepts or body images which may be 
assumed to be characteristic of disparate lev- 
els of age and social status. The specific ques- 
tion under consideration is whether figure 
drawing is a technique which is sensitive to 
changes in body image occurring from child- 
hood to old age. 

That changes in body image might be an- 
ticipated in individuals who experience the 
physiological vicissitudes associated with ag- 
ing is suggested by Schilder; “When we build 
up a persistent and coherent image of the 
body we build it up out of our emotional mood 
which is based upon biological tendencies . . . 
the changes in the biological unit reflect them- 
selves in the more plastic image of the body” 
(4). 

Several studies have found that the draw- 
ings of institutionalized aged persons differ 
from the drawings of certain groups of nor- 
mal adults. According to one recent study, as 
compared with the productions of university 
graduate students, those of the aged were 
characterized by incompleteness, lack of pro- 


1 Thanks are due Drs. Marion Wieman and Alan 
K. Rosenwald of the University of Illinois, College 
of Medicine for their advice and cooperation in this 
study. 


portionality, and lack of integration (2). 
Machover (3) has observed that senile indi- 
viduals draw small, puny figures, use rela- 
tively little area of the paper for the draw- 
ing and center the figure poorly upon it. 

These observations are interesting in view 
of current theory with respect to the tech- 
nique. In clinical work, gross departures from 
“normative” performances in terms of such 
formal variables as area, size, and centered- 
ness are frequently interpreted as suggestive 
evidence of deviant intrapersonal states. Area 
utilized, for example, is considered to reflect 
self-assessment in relation to the environment. 
Height is similarly interpreted as an indica- 
tion of feelings of relative self-significance. 
Centeredness is often considered to be affected 
by the subject’s general security level and by 
his orientation toward reality. 

Gross differences in self conceptualization 
and, presumably, in body image would be an- 
ticipated in a comparative study of the insti- 
tutionalized aged and normal children close 
to the threshold of adolescence. In one way 
their life situations are somewhat similar: 
both groups are explicitly dependent upon 
protecting and nurturant environmental forces 
for gratification of primary needs. However, 
as contrasted with the vigor, growth, increas- 
ing self-awareness, and recognition of poten- 
tial among the children, the psychological dec- 
rements which accompany the physiological 
changes in the aged are a matter of popular 
concern. Loss of self-esteem, lessened vitality, 
and the sense of impending further decline 
seem to underlie the frequently observed ex- 
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Table 1 
Identifying Data for Subjects 
Education N 
(yrs. of — 
formal Fe- 
Mean Age schooling) Males males 
Institutionalized 
aged subjects 78.5 Fr 13 il 
(N = 24) 
Normalchildren 8.7 3 10 15 
(N = 25) 


* Estimated because of discrepancies in education of aged 
who were all foreign born. 


pressions of futility and hopelessness among 
the institutionalized aged. The question is 
whether these differences are reflected in the 
figure drawings of the two groups. 

Such formal variables as area, height, and 
relative centeredness would appear to be ex- 
cellent points of comparison between the two 
contrasting groups. They are elements which 
are relatively free from the influences of previ- 
ous education and artistic training. They are 
much less subject to these influences and to 
conscious control than such features as inte- 
gration of the figure, facial expression, and 
proportionality. Also, with respect to these 
latter elements, the influence of peripheral, 
sensorimotor impairments would have to be 
taken into account. In the present study of 
formal characteristics peripheral impairment 
would seem to be of minimal significance. 
If the formal characteristics, area, height, 
and centeredness do reflect self-conceptual or 
body-image dispositions, then it would be hy- 
pothesized that differential performance in 
terms of them would distinguish between the 
drawings of institutionalized aged and those 
of normal children. 


Method 


Twenty-five third grade children from a 
public school and twenty-four elderly resi- 
dents of a home for the aged were the sub- 
jects. All were administered the Figure Draw- 
ing Test (3) and were provided with similar 
materials, medium soft pencils and plain 84 
X 11 sheets of paper. The aged subjects were 


given the test in individual examination ses- 
sions. The children were tested as a group. 
Instructions were limited to the simple direc- 
tions: “Draw a person; you may make it any 
way you wish, but draw a whole person, not 
just the face.” 

Table 1 describes the personal character- 
istics of the groups. The children were a typi- 
cal class of third graders. Those markedly 
high or low in previously administered Stan- 
ford Binet scores (above IQ 110 or below IQ 
90) were not included in the present sample. 
The institutionalized aged were also a fairly 
“typical” group, in terms of the home’s stand- 
ards. All were ambulatory and in “good” con- 
tact. Markedly deteriorated individuals and 
those considered to be grossly impaired intel- 
lectually as evaluated on the basis of the 
Bender Gestalt Test and psychiatric inter- 
view, were not included in the study. 

Comparison of the aforementioned formal 
characteristics of the drawings were under- 
taken as follows: 


Area. The area of each drawing was precisely 
measured by means of a geographer’s planimeter. The 
instrument yields results in terms of square inches. 
The areas of the two groups, treated as scores, were 
then compared. Because of the somewhat skewed 
distribution of the data the nonparametric median 
test was employed. 

Figure height. The distance from figure top to fig- 
ure bottom was measured with an ordinary ruler. 
Heights to the nearest inch were treated as scores. 
The scores of the groups were then compared by 
means of the median test. 

Centeredness. A grid of concentric circles, + inch 
apart, was constructed. Each circle was numbered 
and regarded as a unit. By placing the grid-template 
over the sheet with the drawing, it was possible to 
measure the distance from the center of sheet to the 
center of gravity (the midpoint between the figure’s 
extremities) of the figure. Measured distances from 
the center in units were termed scores and were com- 
pared by means of the median test. 


Results 


Table 2 shows the differences in the per- 
formances of the two groups of subjects. 

On the measure of area the group of insti- 
tutionalized aged and that of normal children 
had medians of 3.0 square inches and 6.8 
square inches respectively. On the second 
measure, that of distance from the center in 
4 inch units, the medians were 11.0 and 6.0. 


Human Figure Drawings by Aged and by Children 


Medians of the group with respect to the 
measure of height were 4.0 inches and 6.0 
inches respectively. 

To provide some indication of the vari- 
ability the number exceeding the combined 
median is given for both groups. For the 
measure of area the numbers are 6 and 18 
respectively. For distance from center they 
are 17 and 8, and for height they are 6 and 
19. 


Discussion and Conclusions 


The results confirm the hypothesis that 
formal characteristics of the human figure 
drawings differentiate two groups widely dif- 
fering in age and social status, and assumed 
to be widely differing in self concept and 
body image. 

The direction of the difference, the more 
constricted, shorter, and less adequately cen- 
tered productions of the institutionalized 
aged, as compared with those of the children 
is congruent with the assumed differences in 
self-attitudes among these groups. What is 
significant, in terms of the present study, is 
that the feelings of self-devaluation and a 
“shrinking” body image should find graphic 
expression in the execution of formal features 
of the drawings. On the other hand, the pos- 
sibility that certain organic factors in the 
aged result in constrictions of the figure is 
counterindicated by nonconstricted Bender- 
Gestalt drawings among this group. 

The findings seem to give additional evi- 
dence of the validity of the assumption that 
formal aspects of figure drawings are related 
to the central variables of self conceptualiza- 
tion and body image. In light of the clinical 
assumption about departures from “norma- 
tive” performance it is interesting to note 
that the children seem to perform more simi- 
larly to normal adults than do the aged sub- 
jects. Levy (1) reported an average figure 
drawing height of 7.0 inches for normal 
adults. The median figures for the aged and 
child subjects of the present study are 4.0 
inches and 6.0 inches respectively. 

The findings suggest the application of 
similar methodology in further investigation 
in this area. Comparison of drawings by in- 
stitutionalized aged and by noninstitutional- 
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Table 2 


Differences in Area Size, Height, and Distance from 
Center of Figure Drawings of Institutionalized 
Aged Persons and Normal Children 


Group 
Institu- 
tionalized Normal Median 
aged _ children test 
Area (sq. inches) 
Median 3.0 68 
Number exceeding 
combined median 6 18 10.8** 
Distance from 
center (}” units) 
Median 11.0 6.0 
Number exceeding 
combined median 17 & 7.4* 
Height (to nearest 
inch) 
Median 4.0 6.0 
Number exceeding 
combined median 6 19 


* Significant at the .01 level. 
** Significant at the .001 level. 


ized aged persons might be undertaken to 
study the question of whether changes in self 
concept and body image are due to psycho- 
logical processes attendant upon aging per se 
or whether they are related to such factors as 
relative independence of functioning and the 
effects of permanent custodial-protective care. 


Summary 


On the basis of reported observations and 
theory regarding human figure drawings and 
their relationships with self concept and body 
image it was considered that such formal 
characteristics of the drawings as area, height, 
and centeredness would reflect aspects of 
change in self concept and body image. The 
relative freedom of these variables from back- 
ground influences and from the influences of 
peripheral impairments was suggested. It was 
hypothesized that differences in self concep- 
tualization between institutionalized aged and 
preadolescent children would be reflected in 
differences in these formal aspects of the 
drawings. The drawings of 24 institutional- 
ized elderly persons were compared with those 
of 25 third-grade school children. The re- 


rf 


474 Martin Lakin 


sults confirmed the hypothesis. More con- 
stricted, shorter, and less adequately centered 
figures were produced by the aged subjects. 
The findings were viewed as support for the 
validity of the view that the central variables 
of self concept and body image are reflected 
in drawings of the human figure. The applica- 
tion of this method in further research in this 
area was suggested. 


Received July 11, 1956. 
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Cue-Counting: A Measure of Anxiety in Interviews 


Andrew S. Dibner 


Veterans Administration Hospital, Dearborn, Michigan 1 


Previous studies relating formal aspects of 
speech to anxiety (2, 3, 4, 5, 6) almost en- 
tirely concern themselves with the reflection 
in speech of anxiety as a more or less per- 
manent concept of personality, i.e., the anx- 
iety of the “anxious” or “disturbed” indi- 
vidual. There have been no speech measures 
reported which were developed to record anx- 
iety as a transient, situational phenomenon, 
although transient anxiety is a relevant issue 
when one studies the psychotherapeutic proc- 
ess. Such speech measures were developed by 
the author in a study relating patients’ anx- 
iety in an interview to the ambiguity of the 
interviewer’s behavior (1). It was desired to 
have speech indices as well as other indices 
of anxiety. The indices developed standard- 
ize the measurement of clinical “cues” of anx- 
iety appearing in speech. 


Method 
Subjects 


Subjects were 39 patients hospitalized on 
the NP Service of a VA hospital. Diagnoses 
were all of psychoneurosis, character disorder, 
or psychosomatic reaction. Subjects ranged in 
age from 25 to 54, had a mean IQ of 105 
(range 85-115) and mean education at the 
10.5 grade level. 


Speech Material 


Transcriptions of tape recordings of initial 
psychological interviews with the subjects 
provided the speech material from which the 
measures were developed. Verbatim typing of 
the transcripts was done by trained typists 
following instructions which standardized in- 
dications of pauses, sighing, laughing, and 
changes in loudness of voice. 


1 Now at Clark University, Worcester, Mass. 


Procedure 


Careful study of transcripts of 8 interviews 
gathered in a pilot study revealed 11 speech 
characteristics which we believed were scor- 
able and related to situational anxiety. These 
are listed and illustrated below. 


UN—Unfinished Sentence. The thought is inter- 
rupted and never completed; another thought might 
be expressed, the old thought approached in a radi- 
cally different way, or silence might occur. Ex- 
amples: 

1. “I went to live with my Dad and I had... 
(UN) I lived in (city) with him until I was 16.” 

2. “Oh, she uh . . she’s a (R) .. (UN) I mean 
she always stayed at home.” 

3. “Well, that’s the main .. . (silence) (UN).” 

BR—Breaking in with a New Thought, generally 
by breaking into another sentence. Also if a new 
sentence introduces a radically different line of 
thought and there is no sign of a preceding pause. 
Examples: 

1. “When I was young my mother... (UN) I 
was (BR) .. uh . . ashamed of myself, very much 
ashamed.” 

2. “And when I had this here uh . . depression . . 
(INT) uh, I know what I’m saying right now (BR) 

. . uh, I had quite a jag.” 

INT—Interrupted Sentence. The thought is inter- 
rupted within the sentence but is eventually com- 
pleted in much the same manner as it was begun. 
Examples: 

1. (See example 2 under BR above). 

2. “I was going .. (INT) see, nobody knew it, 
but I decided I was going home.” 

R—Repeating Words or Phrases. When full words 
or phrases (either parts or full) are repeated close 
in sequence within a sentence. Also included are 
modifications of words or phrases. Examples: 

1. “I don’t have . . I don’t have (R) as much gall 
as some people.” 

2. “I worked for two years in a . . an (R) office 
when I was 16 . . no, 17 (R).” 

ST—Stuttering. When repeating parts of words or 
when leaving a word unfinished. Examples: 

1. “I ne . . never (ST) thought of it like that.” 

2. “He even sta... (ST) he even stayed (R) 
away from me.” 
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IDK—I Don’t Know. When the patient says “I 
don’t know,” not in answer to a direct question, but 
as in resignation, disgust, or despair. Examples: 

1. “So that’s the way it happened. I don’t know 
. . . (IDK) things were all mixed up.” 

2. “. .. and the pain has been there since then. 
(Pause) I don’t know (IDK).” 

SI—Sighing or Deep Breath. 

L—Laughing. Any kind of laugh or chuckle. 

V—Voice Change. Change in volume, becoming 
louder or lower. 

?—Questioning the Interviewer. When the patient 
asks the interviewer to repeat what he said, not for 
elaboration but because he was not heard or under- 
stood. Examples: 

1. P: “What?” (?) 

2. P: “Pardon me?” (?) 

BL—Blocking. When there is groping for the 
proper expression, indicating unusual hesitation. Ex- 


amples: 

1. “Soshe .. uh .. well... uh.. (BL) toldme 
I shouldn’t come back.” 

2. “So I..so I (R) .. (snaps fingers) (BL) I 


(R) told her I wouldn’t anyway.” 


To estimate reliability of scoring these cues, 
two raters trained together by reviewing and 
discussing the scoring of eight 10-minute sam- 
ples of interviews. Then they separately scored 
cues on 10 interviews and their percentage of 
agreement on each cue was computed by di- 
viding the number of agreements by the total 
of agreements plus disagreements. Agreement 
means both raters note the same cue in the 
same spot on a line of speech. Disagreement 
occurs when either rater notes a cue and the 
other calls it some other cue or leaves that 
spot unmarked. 


Table 1 
Percentage of Agreement of Two Raters on Each Cue 


Agree/agree 

plus % 
Cue disagree Agreement 
UN 82/134 61 
BR 5/26 19 
INT 24/72 33 
R 143/213 66 
ST 16/17 94 
IDK 9/19 47 
SI 14/14 100 
L 22/22 100 
Vv 41/41 100 
? 4/4 100 


BL 2/2 100 


Andrew S. Dibner 


Ten-minute samples of the remaining 29 
typescripts of interviews were then cue- 
counted and a Cue Score, expressed as a per- 
centage, was computed for each interview by 
dividing the total number of times the cue 
was scored in the interview by the total num- 
ber of typed lines of patient speech. To ana- 
lyze the interrelationships of the cues, a con- 
tingency coefficient between each distribution 
of cue scores was then computed. The in- 
tercorrelations indicated the groups of cues 
which we called Cue Scale I and Cue Scale II. 

Group cue scores for Cue Scale I and Cue 
Scale II were obtained by converting the cue 
score distributions over all interviews to per- 
centile scores, then for each interview adding 
the percentile cue scores within each scale. 
Because of its negative correlation with V and 
SI, the L distribution was made negative be- 
fore combining it with them. 


Results 


Table 1 presents the raters’ agreement on 
scoring each cue. The perfect rater agreement 
on SI, L, V, ?, and BL results from the fact 
that they are clearly marked on the type- 
script by the transcribing secretary. UN, R, 
and especially ST have respectable per cent 
agreements, but it was obvious that if BR, 
INT, and IDK were to be used, they would 
have to be grouped with the more reliable 
cues. 

Table 2 presents the intercorrelations of 
the cue scores. The two cues ? and BL were 


Table 2 


Intercorrelations of Cues 
Contingency Coefficients Corrected for Class Index 
(39 Interviews) 


Cue BRINT R STIDK SI L_ V 
UN 42 .20 -.03 20 56 -13 
ii 
DK — — — — — -B -23 


Cue-Counting: A Measure of Anxiety in Interviews 


omitted from consideration because they oc- 
curred rarely in the data and provided dis- 
tributions with very little spread. 

By inspection of Table 2, the cues were 
grouped into patterns containing the high- 
est possible intercorrelations. The following 
groupings were prominent: BR-UN-IDK- 
INT-R-ST and L-V-SI. These groups we call 
respectively Cue Scales I and II. With the 
exception of IDK, the cues of Scale I are all 
indicators of speech disruption. Scale II is 
composed of cues related to nonverbal voice 
characteristics. The reliability of rating of the 
cues grouped in Scale I is 65 per cent and of 
those grouped in Scale II is 100 per cent. Cue 
Scales I and II have a Kendall’s rank corre- 
lation (Tau) of — .03 (p= .81). They are 
unrelated and used independently as anxiety 
measures. 


Discussion 


In the author’s study referred to earlier 
(1), the Cue Scales and three other indices 
of anxiety (skin conductance curves, clini- 
cians’ global ratings, and patients’ self-re- 
ports) were used to measure patient anxiety 
in reaction to structured or ambiguous inter- 
viewing technique. The Cue Scales showed 
significant relationships to degree of am- 
biguity, just as did the other measures. The 
anxiety measures showed no significant cor- 
relation with each other, except for the corre- 
lations (.32 and .26) of the two Cue Scales 
with anxiety ratings of transcripts by clinical 
judges. This correlation indicates only that 
both sets of speech cues (uncorrelated with 
each other) enter into a judge’s estimate of 
anxiety when reading transcripts. An analy- 
sis of the findings of the study supports the 
hypothesis that anxiety is not a unidimen- 
sional phenomenon but can be evidenced by 
activation of various physiological or behav- 
ioral systems in different persons. Thus a 
person who shows speech disruption in a situ- 
ation of uncertainty need not show concomit- 
ant GSR changes. Is GSR a better indicator 
of anxiety than speech disruption? At this 
point it seems safe to say only that they are 
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both methods by which we can identify anx- 
iety in order to learn more about it. The 
speech disturbances measured by the Cue 
Scales are clearly related to situational anx- 
iety and not to anxiety as a personality trait, 
for the patient groups showed significant 
speech differences in reaction to the varied 
interviewer technique although they were ran- 
domly selected and equated for diagnosis and 
several other pertinent variables. 

The Speech Scales represent speech vari- 
ables indicative of “emotionally involved” 
and disorganized verbal behavior. Some of 
the variables seem to be direct instances of 
disorganization (e.g., incomplete sentence or 
breaking in of new thoughts) and others as 
defenses against or attempts to control disor- 
ganization (e.g., repetition or modification of 
words and phrases). The lack of correlation 
(Tau = .01 and .07) of the two scales with 
the patient’s own report of anxiety leads us 
to feel that the cues of disorganization, in- 
volvement, and defense, measured by the 
scale, are anxiety indicators not subject to 
the person’s awareness. Further research 
should be done to explore other speech cues 
of anxiety, to evaluate the consistency of ap- 
pearance of the cues under varying conditions 
of anxiety, and to investigate the relationship 
of the cues to individual speech habits. The 
use of the scales as tools to measure anxiety 
related to such therapeutic phenomena as in- 
terpretation, reassurance, or transference, as 
well as for evaluation of therapeutic progress 
should be explored. 


Summary 


The derivation of two measures of transient 
or situational anxiety as revealed in speech 
has been described. The two uncorrelated 
measures are based upon a count of speech 
cues reflecting disorganization, emotional in- 
volvement, and defensiveness. The use of the 
scales in a study of anxiety in psychological 
interviews was described, and some implica- 


tions of the scales as anxiety measures were 
discussed. 


Received March 12, 1956. 
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Test Anxiety and Intelligence Test Performance 


I. Zweibelson 
Milford, Connecticut, Public Schools 


Relatively few persons have escaped the 
worry which is usually associated with the 
group tests which are used extensively in our 
culture. It is also true that there have been 
relatively few studies of the influence of fear, 
worry, and anxiety on group test perform- 
ance. Studies (3, 7) of the worries of ele- 
mentary school children have revealed that 
they worried most about test failure. It has 
also been shown (8) that college students 
with high anxiety performed noticeably more 
poorly on group intelligence tests than stu- 
dents with low anxiety. 


The Problem 


The present study * concerns the relation 
between children’s test worry or anxiety and 
their scores on group tests of mental ability 
which differed in content, format, and method 
of administration. The following question was 
posed: does performance in mental ability 
test situation x, y, or z differ in relation to 
the relative amount of test anxiety indicated 
by the subject? Table 1 briefly summarizes 
the important differences of the three men- 
tal ability tests used in the study: the Otis- 
Alpha, the Davis-Eells, and the Otis-Beta. 
An index of pupil test anxiety was secured 
by the investigator who read a list of 43 
questions * to 12 classes of fifth grade ele- 
mentary school children. Each child re- 
sponded on an answer sheet by circling a 


1 Adapted and condensed from a dissertation sub- 
mitted in partial fulfillment of the requirements for 
the Ed.D. degree at Teachers College, Columbia 
University, in 1955. Part of the data were collected 
in connection with the USPHS Project M-712 un- 
der the direction of Professor Seymour B. Sarason 
of Yale University. 

2 The questionnaire used was developed under the 
direction of Professor Sarason of Yale. 


“Yes” or “No” for each question. The ques- 
tions dealt specifically with fears, worries, 
and feelings in test and testlike situations. 


Hypotheses 


1. Given three tests of mental ability, the 
Davis-Eells, the Otis-Alpha, and the Otis- 
Beta, performance on the Davis-Eells and the 
Otis-Alpha tests should be less related to test 
anxiety than in the case of the Otis-Beta. 

2. In the case of the Otis-Beta and the 
Davis-Eells, the “High-Anxious” should do 
better on the mental ability test taken second 
than on the one taken first, regardless of the 
test, ie., the order in which the tests are 
taken will have an effect on the scores of the 
“High Anxious.” 

Design 

Five of nine elementary schools in Mil- 
ford, Connecticut, were selected as schools 
which served town areas thought to be repre- 
sentative of pupils in all of the town schools. 
All of the fourth grade children in the se- 
lected schools were given three mental abil- 
ity tests (1, 5, 6) and the Test Anxiety 
Questionnaire. The Otis-Alpha and the Ques- 
tionnaire were administered in the ninth 
month of the fourth grade school year. The 
Otis-Beta and the Davis-Eells were adminis- 
tered during the first month of the fifth grade 
school year. Half of the pupils were given 
the Otis-Beta first and half the Davis-Eells 
first—the total N was 258. There was ap- 
proximately a week between the administra- 
tion of the first and the second tests. 


Results 


The correlations among the three mental 
ability tests were: 
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Table 1 
Major Differences of Three Group Mental Ability Tests 


Provision for 


picture or symbol marked 


booklet form 


short period preceding 


Practice each of two sittings 
Time Limit 40-50 minutes 
Speed Factor emphasis tends to be on 


power although some 
don’t have time to try all 
of the items 


large space for mark next 
to test item 


booklet form 


substantial periods pre- 
ceding several sections 


100-120 minutes 


emphasis upon power, 
each pupil could attempt 
every item 


Ditference 
Factor Otis-Alpha Davis-Eells Otis-Beta 

Test Items variety of problems in- real-life problems, perso- school achievement 
clude geometric, pictorial, nal oriented 
and language ability items 

Style semiacademic semihumorous academic 

Method of semiformal test atmos- game atmosphere stressed formal test atmosphere 

Administration phere 
_ Instructions little provision for praise substantial provision for little provision for praise 

and reassurance praise and reassurance and reassurance 

Format free of reading free of reading reading required 
small, clear pictures large, clear pictures print 


small space on separate 
sheet for mark 


3 sheets fold like accor- 
dian 


short period preceding the 
one sitting 


30 minutes 


speed important, no pupil 
could attempt every item 


Otis-Alpha vs. Davis-Eells 59 
Otis-Alpha vs. Otis-Beta 55 
Davis-Eells vs. Otis-Beta 49 


These correlations are significant at the one 
per cent level of probability. 

The correlations among the tests and test- 
anxiety scores were: 


Otis-Alpha vs. Text Anxiety — .28 
Otis-Beta vs. Test Anxiety — .24 
Davis-Eells vs. Test Anxiety — .14 


These correlations are significant at the five 
per cent level. The significance of the differ- 
ences between the Test Anxiety and the in- 
telligence test correlations was found by the 
use of a formula developed by Hotelling and 
given by Walker and Lev (9, p. 257). In 


this formula the correlations among the tests 
are taken into account. The difference be- 
tween the Test Anxiety vs. Otis-Beta r of 
— .24 and the Test Anxiety vs. Davis-Eells 
r of — .14 was not found to be significant, 
although the results are in the predicted di- 
rection (p = .10). The difference between the 
Test Anxiety vs. Otis-Alpha r of — .28 and 
the Test Anxiety vs. Davis-Eells r of — .14 
was found to be significant at the five per 
cent level of probability. It appears that the 
first hypothesis receives support from these 
results although it is most clear in terms of 
the Otis-Alpha vs. Davis-Eells data. 

Table 2 shows that the “low” test-anxious 
make higher mental ability indices than the 
“high” test-anxious regardless of the order of 
test administration and regardless of the test 
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instrument. There is also a tendency for the 
“High Anxious” to do better on the test taken 
second than on the test taken first. However, 
an analysis of variance of the data reported 
in Table 2 shows no significant variation in 
the size of the difference between the Davis- 
Eells and Otis-Beta indices because of: (a) 
the effect of the extent of test anxiety; or 
(6) the effect of the order in which tests 
were taken; or (c) the effect of the interac- 
tion of these variables. It should be pointed 
out, however, that the order of test admini- 
tration was the greatest source of variation 
of the several sources investigated. The data 
in Table 2 clearly do not allow one to accept 
Hypothesis 2, although the fact that the 
trends are in the predicted direction suggests 
that consideration of other factors in future 
studies may give more positive results. It 
might be pointed out in this connection that 
in studies of test anxiety with college students 
(2, 4) and children there is evidence that the 
“High Anxious” subjects tend to improve 
rather markedly in the course of learning. 
Discussion 

The results of the present study support 
the position that test anxiety may be related 
to and effect group test results. There is also 
some support for the Davis and Eells con- 
tention that their test is less influenced by 
anxiety than other conventional tests. Evalua- 
tion-connected experiences which involve fail- 
ure or threat of failure may serve to engender 
personal reactions which interfere with test 
performance. The standardized group test 
situation may have threatening factors inad- 
vertently attached to the experience by the 
test-maker or the examiner. 

The anxiety-provoking role of the examiner 
has not been studied in this present investi- 
gation. The different examiners (teachers and 
psychologists) in this study may have influ- 
enced the results by provoking different anx- 
ious reactions in the children. It seems im- 
portant to draw attention to such possibili- 
ties and to suggest that the examiner’s role 


Table 2 


Mean and Variance of Mental Ability Test Scores of 
High and Low Test-Anxious Pupils Within 
Order of Test Administration 


(N = 33 in each group) 


Pupil test anxiety 


Lower 31% Upper 31% 
Order of administration xX s? xX s? 
Otis-Beta Administered 
First 
Otis-Beta Score 5.5 3.9 40 2.2 
Davis-Eells Score 5.4 38 44 2.4 
Davis-Eells Administered 
First 
Otis-Beta Score 39 4.2 
Davis-Eells Score 47 3.1 


in effecting group test performance might 
warrant investigation. 
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The Relation of the Children’s Manifest Anxiety 
Scale to the Concept of Anxiety as Used 
in the Clinic’ 


Robert D. Wirt and William E. Broen, Jr.’ 


University of Minnesota 


Considerable research has been published in 
recent years indicating relationships between 
a construct measured by the Taylor Manifest 
Anxiety Scale (MAS) and other variables 
mainly within the scope of learning theory. 
Taylor recently has reviewed this work (2). 
More recently Castaneda, McCandless, and 
Palmero have adapted the MAS for use with 
children (CMAS) and have reported research 
indicating that its relationship to other vari- 
ables approximates that of the adult form of 
the scale (1). 

Because of the importance of the anxiety 
construct to clinical work, we investigated 
the relationship between the CMAS and the 
concept of anxiety as used in the clinic. Using 
the CMAS we tested 505 children in the 4th, 
5th, and 6th grades in the Minneapolis Pub- 
lic Schools and 34 children from the same 
grades who had been referred to its Child 
Study Department. Predicting from the clini- 
cal concept of anxiety as we understand it, 
we hypothesized that the clinic children would 
score higher on the CMAS than the others. 
Analysis of variance failed to support this ex- 
pectation. We also predicted there would be 
a positive relationship between anxiety as 
rated by psychologists after diagnostic study 


2An extended report of this study may be ob- 
tained without charge from Robert D. Wirt, De- 
partment of Psychology, University of Minnesota, 
Minneapolis 14, Minnesota, or for a fee from the 
American Documentation Institute. To obtain it 
from the latter source, order Document No. 5038 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
ton 25, D. C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks payable 
to Chief, Photoduplication Service, Library of Con- 


gress. 
2 Now at the University of Kansas. 


or as rated by teachers and scores on the 
CMAS (not verified); that intelligence would 
be inversely related to CMAS scores (not 
verified); and that there would be an in- 
verse relationship between CMAS and read- 
ing achievement scores with intelligence par- 
tialled out (significant at the .04 level). 

As a result of this research we feel that the 
CMAS (and the MAS insofar as it is com- 
parable) although undoubtedly a valuable 
and predictive instrument is measuring some- 
thing different than is denoted by the clini- 
cal concept of anxiety. We find no contradic- 
tion between this conclusion and work cited 
by Taylor (2) which shows a relationship 
between the MAS and other clinically used 
variables. We believe the common factor un- 
derlying the relationships to be one of “will- 
ingness to say deviant things about the self” 
or “test-taking attitude” which factor is not 
necessarily highly related to anxiety as the 
concept is clinically used. Studies showing sig- 
nificant relationships have emphasized mani- 
fest anxiety, as, of course, do the names of 
the scales. 

As has been pointed out by others, concepts 
such as anxiety need much more careful de- 
notation than they have yet been given. We 
are presently engaged in further research on 
the construct of anxiety with the hope of giv- 


ing it greater specificity. 


Brief Report. 
Received July 30, 1956. 
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The “Alienation Syndrome” and Adler's Concept 
of “Distance” 


H. L. Ansbacher 


University of Vermont 


In a paper published in this Journal and 
entitled “Alienation, social apperception, and 
ego structure,” Anthony Davids (2) reported 
an Affect Questionnaire which he constructed 
in collaboration with Henry A. Murray. The 
questionnaire measures eight critical dispo- 
sitions of personality: sociocentricity, opti- 
mism, and trust (three “positive” or socially 
desirable dispositions); and egocentricity, 
pessimism, distrust, anxiety, and resentment 
(five “negative” or socially undesirable dis- 
positions). Statistical analysis of results ob- 
tained with this questionnaire from Harvard 
students led to the formulation of a syndrome 
termed “alienation,” which is composed of 
the five negative dispositions. 

The purpose of the present note is to show 
that the concept of “alienation” bears close 
resemblance to Adler’s concept of “distance.” 
Etymologically, the two terms have in com- 
mon the idea of isolation from the group 
(like the sociologists’ term “anomie”), the 
first denoting the feeling of being a stranger, 
the second, the attitude of standing apart 
from the rest. 

Adler’s “distance” manifests itself in a 
fourfold modus of, figuratively, moving back- 
ward, standing still, hesitating, or construct- 
ing obstacles (1, pp. 273-276). All kinds of 
anxiety are seen as forms of distance behav- 
ior. To anxiety so described, Adler subordi- 
nates a number of specific feelings and atti- 
tudes which bear close similarity to Davids’ 
negative dispositions. In order to demonstrate 
this, we shall insert in the following quota- 
tion from Adler the corresponding descrip- 
tive terms from Davids. 
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Once a person has acquired the attitude of run- 
ning away from the difficulties of life, this attitude 
may be greatly strengthened and safeguarded by the 
addition of anxiety (Davids’ anxiety). ... It em- 
bitters (Davids’ resentment) his life to a marked de- 
gree, renders him quite unsuited for making con- 
tacts, and thus also for laying the basis for a peace- 
ful life and for fruitful accomplishments. ... We 
shall meet again the well-known type who feels him- 
self forced by necessity to think more of himself 
and consequently has little left for his fellow man 
(Davids’ egocentricity). As though in confirmation 
of this view one often finds that such people are in- 
clined to think of the past or of death, which has 
about the same effect... . Or they emphasize that 
all is vanity, after all, that life is short, and one can- 
not know what will happen (Davids’ pessimism)” 
(1, pp. 276-277). 


This quotation permitted the insertion of 
all of Davids’ negative dispositions except 
distrust, which is found in Adler under the 
heading of Oversensitivity, one of the char- 
acteristics of the neurotically disposed indi- 
vidual. “Distrust in his own strength which 
awakens doubt about everything; distrust in 
others which is socially disturbing and dis- 
rupts every group—these at times character- 
ize the oversensitive patient” (1, p. 290). 

With these similarities, it is understandable 
that the further findings of Davids are in ac- 
cordance with Adlerian theory. This is true 
especially for the finding “that individuals 
who view the world and their relationship to 
it in a negative (i.e., alienating) light have a 
weak ego structure (i.e., poor personal and 
social adjustment) as estimated by experi- 
enced clinical judgment” (2, p. 27). Adler 
had started with the observation that poor 
adjustment is characterized by the syndrome 
of distance which is the resultant of increased 
inferiority feelings and lack of social interest. 
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Davids developed the Affect Questionnaire 
to investigate relations between degree of 
alienation, social apperception, and ego struc- 
ture, meaning ego strength vs. ego weakness. 
Not only by the formulation of the concept 
of alienation, but also by this purpose, is 
Davids’ investigation in line with Adlerian 
theory. The relationship, according to Adler, 
would be: The maladjusted (alienated) indi- 
vidual is one who, due to increased inferior- 
ity feelings (ego weakness) and concomitant 
lack of social interest, apperceives the world 
as a hostile place. “Schema of apperception”’ 
is a term frequently employed by Adler. Spe- 
cifically, the content of the schema of apper- 
ception of the maladjusted would be: They 
“feel and live as though they were in an 
enemy country. Fighting, hesitating, stopping, 


H. L. Ansbacher 


escaping, much more occupied with their own 
persons than with others, they are therefore 
selfish, inconsiderate, lacking in social inter- 
est, courage, and self-confidence because they 
fear defeat more than they desire success” 
(1, p. 118). 

Altogether, the paper by Davids can well 
be taken as experimental support and exten- 
sion of Adlerian theory. 
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It is indeed gratifying when another in- 
vestigator is able to make use of one’s find- 
ings in building a bridge between empirical 
data and theory. This is especially true when 
the data were not gathered with a view to- 
ward supporting or refuting hypotheses de- 
rived from the particular theory with which 
they are eventually linked. The fact that the 
theory in question, that of Adlerian psychol- 
ogy, is not particularly new on the psycho- 
logical scene, coupled with the fact that this 
theory has been submitted to very little ex- 
perimental verification in controlled experi- 
ments, makes it seem all the more noteworthy 
that Ansbacher (2) was able to use our re- 
cent experimental evidence to support Adler’s 
theoretical statements. 

First, let me make a few comments on the 
dispositions which form the personality syn- 
drome of alienation. On numerous occasions 
I have been asked why we chose to work with 
these particular eight dispositions of person- 
ality. That is, why did we include just these 
eight dispositions in our researches? Murray 
and I were first faced with the problem of 
choice of attributes to study when we em- 
barked upon the task of constructing an au- 
ditory projective technique (5) several years 
ago. We wanted to include in this auditory 
test spoken statements and phrases indicative 
of certain critical dispositions of personality. 
By means of selective perception, apper- 
ception, accentuation, and retention of the 
spoken statements we hoped to secure a 
measure of the extent to which the perceiver 
possessed the various dispositions in his own 


personality. In our original pilot studies we 
experimented with many variables in addi- 
tion to those which have now come to be re- 
garded as forming an “alienation syndrome.”’ 
For example, we employed other dispositions 
or attributes, such as self-pity, gratitude, ac- 
tivity, passivity, and several others. 

On the basis of these pilot studies, we 
found that it was difficult to differentiate 
sharply among certain of the dispositions. 
That is, we found that both in writing state- 
ments to represent certain dispositions and 
in attempting to score subjects’ responses, 
certain dispositions tended to overlap and 
some seemed not to be assessed with as much 
validity as others. Consequently, on the basis 
of practical considerations and preliminary 
studies, we decided to focus on the eight dis- 
positions which are currently measured by 
both the auditory projective technique (5) 
and the Affect Questionnaire (4). 

It should be pointed out, however, that de- 
ciding upon these dispositions was not merely 
an empirical or a theoretical development. 
These dispositions were proposed by Murray 
as being of critical importance based on his 
theory of personality dynamics and on his 
years of intensive investigation of normal and 
abnormal personalities. If one looks into Mur- 
ray’s contributions to the field of personality 
assessment (e.g., 7, 8, 9) it is evident that 
these eight dispositions, and several others, 
have long played a prominent role in his 
theorizing and his empirical investigations. 
Thus, the selection of these eight dispositions 
for study was based upon Murray’s wisdom, 
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experience, and clinical knowledge about traits 
or dispositions that are of critical importance 
in the dynamics of personality. 

Turning now to consideration of the Affect 
Questionnaire, it was constructed originally 
as an instrument against which to validate 
findings derived from the auditory projective 
test. This technique attempts to assess the 
relative strength of the eight dispositions in 
subjects’ personalities by having them indi- 
cate their degree of agreement or disagree- 
ment with statements designed to represent 
the various dispositions. It was found that 
two experienced clinical psychologists could 
agree perfectly in their classification of the 
statements as indicative of the different dis- 
positions. When the Affect Questionnaire 
(which consists of 10 statements representa- 
tive of each of the eight dispositions, mak- 
ing a total of 80 items) was administered 
to groups of subjects, it was found that 
there were highly significant intercorrelations 
among the five negative dispositions, and that 
scores on these dispositions were negatively 
correlated with scores on each of the three 
positive dispositions. Thus, on the basis of 
these empirical findings (4), which were sup- 
ported by unpublished findings obtained with 
additional groups of both normal and mal- 
adjusted subjects, we labeled the interrelated 
negative dispositions as forming an “aliena- 
tion syndrome.” 

From the brief description of the selection 
of dispositions for study and the empirical 
findings which led to formulation of the syn- 
drome, it is quite evident that the link with, 
or similarity to, Adler’s concept of “distance” 
was not readily apparent to us, nor were our 
researches motivated by Adlerian theory. 
Thus, it seems quite striking that our em- 
pirical findings fit so neatly with Adler’s 
theoretical formulations, and that these dis- 
positions do, in fact, tend to coexist in the 
personality just as Adler believed on the ba- 
sis of his theoretical convictions and his clini- 
cal experience. 

Today much research attention is being de- 
voted to submitting to experimental test many 
of the theoretical notions put forth by Freud 
on the basis of his clinical experience. How- 
ever, this doe not seem to be the case with 
Adlerian theory. It is, therefore, especially 


interesting to find that an experiment such 
as ours, which fits into the currently popular 
area of research termed “social perception” 
and which uses experimental techniques and 
statistical procedures, fits so nicely with hy- 
potheses that would be generated by Adlerian 
theory. In other words, if we had taken as 
our point of departure the theoretical notions 
and statements advanced by Adler, and had 
designed our researches specifically to test 
propositions derived from his theory, we 
would have confirmed them in a well-con- 
trolled laboratory investigation employing 
modern statistical procedures. Since there is, 
undoubtedly, much of value in Adler’s theory 
(3), and since many features or derivatives 
of Adlerian theory are put to use in present- 
day clinical practice (1, 6, 10), it seems of 
real value that Ansbacher was able to per- 
ceive the link between our seemingly unre- 
lated experiment and Adler’s theoretical con- 
cepts. 
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Blum, Gerald S. The Blacky Pictures. New York: 
Psychological Corp., 1950. 


Reviewed by 


Samuel J. Beck 
University of Chicago 


A homely, obstreperous, and by now not unloved 
canine, who plays the titlke—and perhaps a hero’s— 
role, this is Blacky of the Blacky test. Sired by psy- 
choanalysis out of TAT he was created ad hoc: to 
test out psychoanalytic theory concerning psycho- 
sexual development, Videlicet, the title of Blum’s 
monograph (2). It is a bold ingress into that most 
fluid area of psychology—depth psychology—which 
Blum undertakes, one more specific in purpose and 
design than those which projective tests have so far 
essayed. These, pioneers that they are, opening up a 
totally new field, have been engaged in attempting 
some broad estimate of the new territory, apprais- 
ing its potential yield in terms of fresh knowledge 
concerning the human personality or, at the least, 
fresh ways of studying it. They have been treading 
warily, with some trembling at the shapes and sights 
unholy lurking in the scientific jungle that surrounds 
the regions of the human personality. However, as at 
other frontiers, the pioneers are followed by the more 
intensive cultivators. These latter can stake out, each 
his special acre, and so a method like that of the 
projective tests advances, by new experiment. 

Blum has devised a test ad hoc for psychoanalytic 
theory by the ingenious technique which activates 
certain of the psychoanalytically formulated uncon- 
scious traits, at the same time that it provides the 
defenses enabling the patient to liberate and com- 
municate these forbidden trends—the defenses of 
identification and displacing. I am not using any of 
the space allotted to me for this review in describ- 
ing the test. He who has not yet made the acquaint- 
ance of the very human Blacky is referred to his 
creator’s monograph (2). In carrying on his experi- 
ment, Blum used, as his point of departure, thirteen 
“dimensions” in psychoanalytic theory. Anyone even 
superficially read in this field will recognize the 
terms. His results, in University of California stu- 
dents, showed statistically significant differences be- 
tween the sexes, statistically significant relationships 
between dimensions, and evidence in the writings of 


Freud and of Fernichel (he quotes the text from 
both) showing an impressive consistency between 
Blum’s significant findings and psychoanalytic the- 
ory. He arrives at the cautious conclusion that “some 
aspects of psychoanalytic theory have demonstrable 
validity.” 

In the few years since he published, his test and 
findings have been subjected to fairly extensive re- 
search, not surprising what with the dominance of 
both psychoanalysis and projective instruments on 
the current psychological scene. Within a year, Sew- 
ard criticized the exclusion of statistically insignifi- 
cant but relevant data in Blum’s report, and also 
pointed to some likely logical flaws (11). Michal- 
Smith, Hammer, and Spitz found their Blacky test 
analysis consistent with indication of Oedipal inten- 
sity in a nine-year-old boy (8). Paranoids were 
studied by Aronson (1) and he reported partial con- 
firmation of Blum’s results. Other investigations that 
have come to my attention (I am making no pre- 
tense here to an exhaustive survey of the literature) 
are: Blum and Kaufman (4) in 14 male patients 
with peptic ulcer; McNeil and Blum (7) relating 
handwriting signs with the psychosexual dimensions ; 
and Ellis (6), who, in a patient with some two hun- 
dred hours of psychoanalysis, had the test rated by 
22 other psychologists. In a survey published in 1952 
by Blum and Hunt (3), they reported also an in- 
vestigation in stutterers (Merchant); sexual offend- 
ers (Lindner) ; behavior in small groups (Swanson). 
A more recent investigation is that of Molish, Lyon, 
and Briggs (9), who compared Blacky test findings 
in 1947 naval recruits found “not maladjusted” by 
the Navy’s neuropsychiatry service, with 390 re- 
cruits who were rejected as unsuitable. These in- 
vestigators, while alert to the psychoanalytic impli- 
cations of their results, were especially interested in 
the test’s potency for differentiating their two groups. 
An inference from their research is that, whatever 
the agreements with psychoanalytic theory, the 
Blacky has its value in practical application, as de- 
rived from empirical data. 

The test has thus a differentiating potency, and 
without a doubt it has the projective instrument 
value of opening a window to latent character traits. 
At the same time results are suspect, as I see it, by 
reason of a major fallacy in the technique of treat- 
ing the data. This consists in the method of “scor- 
ing” each story and the other criteria as “strong” 
and the various gradations of “strong.” The judg- 
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ments are made by each examiner, without recourse 
to any frame of reference. Scorings must, there- 
fore, be a function of each scorer. They can, I will 
concede, be made reliable as between two investi- 
gators who have worked together closely and have 
learned the same signs and language, in recognizing 
and naming the psychological traits or “dimensions.” 
Except between two such colleagues, or for some 
one examiner, results are only spuriously compa- 
rable. The degree of error to which the procedure is 
subject is seen in Blum’s own monograph: a denial 
of a response is construed as a repression, and since 
a repression involves strong emotion, the response is 
scored “strong” (2, p. 34). To construe denial as re- 
pression—which is a hypothesis and not a datum— 
is interpretation. Thus a quantitative profile is de- 
veloped and is reported as S’s production although it 
is actually the examiner’s interpretation of S’s pro- 
duction. 

This criticism touches on what is the Achilles’ heel 
of each projective test. It dictates the need, there- 
fore, of exploiting the objectivity potential in the 
test. The needed technique is that of patterning out 
its data into impersonal structure to be described 
operationally, and so creating some stable frames of 
reference by which the productions of any S can be 
judged, if not measured. Some of the projective tests 
do lend themselves more easily to such impersonal 
structuralization than others. But to the extent that 
the instrument falls short of such objectivity, it is 
not a test but a two-person situation, two persons— 
S and examiner, each of whom may influence the re- 
sults. Schafer (10) has been especially emphatic con- 
cerning the interpersonal relation between examiner 
and subject, and the possible role of the examiner’s 
personality in fashioning the final test picture. Bruns- 
wik writing from another theoretical viewpoint, ar- 
rives at a similar conclusion. “Since in any testing 
procedure the examiner constitutes part of the ex- 
ternal stimulus situation, representative design de- 
mands that examiners should also be sampled,” and 
he refers to the “double jeopardy of generalization” 
(5, p. 131). 

An important implication follows from these po- 
sitions of Schafer and of Brunswik: projective test 
investigations will need to find some method of 
keeping the examiner factor constant, if reports by 
two or more persons are to be comparable. The 
problem becomes at this point the baby of the theo- 
rists and logicians of science. This circumstance does 
not, however, ease the burden of the psychologists 
searching personality by means of projective tests. 


The fact is they have so far been arriving at right 
answers, even if they do not know how. Their an- 
swers are bound to be that much more accurate as 
they are based in sound method, and as they take 
pains to devise such. This is where I see Blacky and 
his growing pains at the present. The test has the 
promise warranting its developing. In spite of the 
fallacy in the scoring procedure, its agreement with 
clinical theory is logically sound. It can serve, fur- 
ther, as an example of other ad hoc techniques which 
can be devised to explore psychoanalytic theory. 
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